Moenomycin biosynthesis-related compositions and methods of use thereof

ABSTRACT

The methods and compositions described herein relate to the identification, isolation, and characterization of genes which encode proteins useful for the biosynthesis of transglycosylase inhibitors such as moes. The methods and compositions also relate to the production of such proteins, and their use in the synthesis of moes, the expression of moes, and the production of modified moes.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of and claims priority under 35 U.S.C. §120 to U.S. application, U.S. Ser. No. 12/377,117, filed May 18, 2010, which is a national stage filing under 35 U.S.C. §371 of international PCT application, PCT/US2007/017999, filed Aug. 13, 2007, which claims priority under 35 U.S.C. §119(e) to U.S. Provisional application, U.S. Ser. No. 60/837,047, filed Aug. 11, 2006, the entire contents of which are incorporated herein by reference.

FIELD OF THE INVENTION

The invention provides polynucleotides and polypeptides related to moe biosynthesis and methods of use thereof. The invention also relates to derivatives of moe A having antibacterial activity.

BACKGROUND

The following discussion of the background of the invention is merely provided to aid the reader in understanding the invention and is not admitted to describe or constitute prior art to the invention.

The bacterial genus Streptomyces is an important natural source of many antibiotics, which include streptomycin, tetracycline, chloramphenicol, macrolides (e.g., erythromycin, carbomycin) and moenomycins (“moes”).

Moes are complex mixtures of phosphoglycolipid compounds produced by many Streptomyces strains as well as other Actinomyces. Streptomyces ederensis, Streptomyces geysiriensis, and Streptomyces bambergiensis (exemplary American Type Culture Collection deposits include ATCC15304, ATCC15303, ATCC13879, respectively) have all been shown to produce moes. See Wallhausser et al., 1965; Lindner et al., 1961. There have also been reports of an unidentified Actinomyces strain which produces compound AC326-alpha, a close relative of one of the moes in the mixture, moe A (He et at, 2000). Additionally, there are reports of Streptomyces strains producing compounds similar to moe A, however the exact chemical structure of these compounds has not yet been established (Weisenborn et al., 1967; Slusarchyk et al., 1969; Takahashi et al., 1970; Meyers et al., 1969).

Although the mixture of moes (e.g., the mixture produced by the strain Streptomyces ghanaensis) has not been thoroughly analyzed, it has been found to contain moe A (FIG. 1) and several other moes, including A₁₂, C₁, C₃ and C₄. Moes A₁₂, C₁, C₃ and C₄ have been shown to represent either shunt products or intermediates of common biosynthetic pathway operating in the producer strain. Additionally, compounds which are thought to be novel moes (Eichhorn, P. et al., 2005; Liu et al., 2003) have also been discovered.

The chemical structure for some moes (e.g., pholipomycin and AC326α) has been established, while the chemical nature of other members of the mixture (e.g., prasinomycins, macarbomycin, teichomycin A1, 11837RP, 8036RP (quebemycin), 19402RP, ensanchomycin, prenomycin) remains to be determined.

Moe A, a major component of the moe mixture, belongs to a unique family of phosphorus-containing secondary metabolites. Moe A is a pentasaccharide decorated with a C₂₅ isoprene chain on one end and a chromophore on the other. The structure of moe A is shown below in Formula I:

Moe A is active against many bacterial strains and is the only antibiotic known to bind directly to and inhibit bacterial transglycosylase (“TG”), enzymes involved in peptidoglycan biosynthesis (FIG. 2). Because peptidoglycan biosynthesis is essential for bacterial survival, the inhibition of transglycosylase is an attractive and as-yet unexploited drug target. Moe A has potent antibiotic activity, with minimum inhibitory concentrations against many Gram positive organisms (“MICs”) in the range of 0.01 to 0.1 μg/mL (Chen L et al., 2003) or greater than 0.1 μg/mL. For example, moe A is an effective inhibitor of cell wall biosynthesis in Gram-positive cocci, including glycopeptide-resistant strains (Goldman, 2000).

It is assumed that the outer membrane of Gram-negative bacteria prevents moe A from reaching the enzymatic target; however, there are several studies showing selective toxicity of moe A and macarbomycin to Gram-negative bacteria carrying conjugative R-plasmids (Iyobe 1973; Ridel 2000). Additionally, some moe A producing Streptomyces strains and various strains not known to produce moe A (or structurally related compounds) are resistant to high concentrations of moe A. Therefore, some general and widely distributed resistance mechanism may exist which is not necessarily associated with moe A biosynthesis. Perhaps some Streptomyces transglycosylases are intrinsically resistant to moe A, or an unusually thick cell wall prevents moe A from reaching its target, or both. Additionally, or alternatively, by analogy to vancomycin resistance genes in S. coelicolor (Hong 2002), there could be a specific, as-yet unidentified moe A resistance gene or gene cluster in Streptomycetes.

The structure-activity relationships of moe A and its derivatives have been studied by Welzel and coworkers. For example, various domains involved in bioactivity and target interactions (Welzel, 1992) have been identified (labeled A through H) (FIG. 1). It has been shown that the C-E-F trisaccharide portion of moe retains inhibitory activity both in vitro and in vivo, while the E-F disaccharide shows activity only in vitro. The phospholipid moiety appears essential for in vivo activity, but the lipid chain can be manipulated to some extent (e.g., hydrogenation of the double bonds does not significantly alter activity). The lipid may be responsible either for anchoring moe to the cell membrane and/or interacting with hydrophobic regions of TGs. Moe analogs containing neryl chains have enzyme inhibitory activity but no biological activity. The carbamoyl group at C3′, the hydroxyl group at C4′, and the carboxamide entity at C5′ of the F ring, as well as the acetyl group at C2′ of the E ring, are all thought to define the moe pharmacophore (Ostash 2005). Unlike many other natural product antibiotics, moe does not contain structural elements of polyketide or non-ribosomal polypeptide origin. However, moes do contain a phosphoglycerate lipid moiety, a structural element not found in any other secondary metabolites. Moe A has been used as a growth promoter in animal feed under the trademark Flavomycin®.

SUMMARY OF THE INVENTION

The present invention provides polynucleotide and/or polypeptide compositions involved in moe biosynthesis and methods of use. For example, polypeptides encoded by nucleic acid sequences such as SEQ ID NOs: 3-25, or fragments, or natural or an artificial variants thereof are contemplated, as are the polypeptides of SEQ ID NOS. 26-48. Compositions including one or more nucleic acid sequences, such as SEQ ID NOs: 3-25, or fragments, or natural or artificial variants thereof are also contemplated. In other embodiments, the nucleic acid sequences or fragments may include an open reading frame. In some embodiments, these nucleic acid compositions may be inserted into expression vectors, and expressed, for example in mammalian, insect, yeast or bacterial cells. Composition comprising one or more polypeptides such as SEQ ID NOs: 26-48, natural or artificial variants, or fragments thereof are also provided.

The methods and compositions also relate to one or more proteins that participate in or are activated for moe biosynthesis. By way of example, but not by way of limitation, these polypeptides may include a composition comprising one or more of the polypeptides selected from the group consisting of: moe A4, moeB4, moeC4, moeB5, moe A5, moeD5, moeJ5, moeE5, moeF5, moeH5, moeK5, moeM5, moeN5, moeO5, moeX5, moeP5, moeR5, moeS5, moeGT1, moeGT2, moeGT3, moeGT4, moeGT5, fragments thereof, or a natural or artificial variant thereof. In one embodiment, the invention provides a composition comprising a one or more of the genes encoding the polypeptides recited above.

The methods and compositions also relate to moe A molecule, derivative or intermediate produced by an organism, such as a bacterial, insect, yeast or mammalian cell, wherein the organism carries one or more mutant or inactivated genes. In some embodiments, the mutant or inactivated gene may be one or more of moe A4, moeB4, moeC4, moeB5, moe A5, moeD5, moeJ5, moeE5, moeF5, moeH5, moeK5, moeM5, moeN5, moeO5, moeX5, moeP5, moeR5, moeS5, moeGT1, moeGT2, moeGT3, moeGT4, and moeGT5. In some embodiments, bacteria may be a Streptomyces strain, such as for example, S. ghanaensis. In other embodiments, the bacteria may be S. ghanaensis ATCC14627, S. lividans TK24, S. albus J1074.

The methods and compositions also relate to enzymatic methods of synthesizing moe, a moe derivative, or a moe intermediate wholly or partially in vitro. In some embodiments, the method includes reacting a one or more moenomycin precursor, derivative and/or moenomycin intermediate with a one or more polypeptide selected from the group consisting of: moeA4, moeB4, moeC4, moeB5, moe A5, moeD5, moeJ5, moeE5, moeF5, moeH5, moeK5, moeM5, moeN5, moeO5, moeX5, moeP5, moeR5, moeS5, moeGT1, moeGT2, moeGT3, moeGT4, and moeGT5, under conditions wherein the moenomycin, the moenomycin derivative, or the intermediate is wholly or partially synthesized. In some embodiments of the method, the method further comprises reacting the moenomycin, moenomycin derivative and/or moenomycin intermediate with a one or more reactants selected from the group consisting of: UDP-sugars, prenyl-pyrophosphates, phosphoglycerate, amino acids, cabamoyl phosphate, ATP and biological cofactors.

The methods and compositions also enzymatic of modifying a moenomycin wholly or partially in vitro. In some embodiments, the method includes reacting a moenomycin, a moenomycin derivative or a moe intermediate with a one or more polypeptide selected from the group consisting of: moeA4, moeB4, moeC4, moeB5, moe A5, moeD5, moeJ5, moeE5, moeF5, moeH5, moeK5, moeM5, moeN5, moeO5, moeX5, moeP5, moeR5, moeS5, moeGT1, moeGT2, moeGT3, moeGT4, and moeGT5, under conditions wherein the moenomycin, the moenomycin derivative, or the intermediate is modified. In some embodiments, the method further comprises reacting the moenomycin, moenomycin derivatives and/or moenomycin intermediates with a one or more reactants selected from the group consisting of: UDP-sugars, prenyl-pyrophosphates, phosphoglycerate, amino acids, carbamoyl phosphate, ATP and biological cofactors.

The methods and compositions described herein also relate to pharmaceutically acceptable compositions of moes, and the treatment of mammals, such as humans, by administering such compositions in a therapeutically effective amount. In some embodiments, the pharmaceutical composition may include a moe synthesized or modified by the methods of the present disclosure.

In another embodiment, the present invention provides an isolated Streptomyces strain selected from the group consisting of: Streptomyces ghanaensis, Streptomyces ederensis, Streptomyces geysiriensis, and Streptomyces bambergiensis strain which carries a one or more mutant or inactivated genes, wherein the mutant or inactivated genes are selected from the group consisting of: moeA4, moeB4, moeC4, moeB5, moe A5, moeD5, moeJ5, moeE5, moeF5, moeH5, moeK5, moeM5, moeN5, moeO5, moeX5, moeP5, moeR5, moeS5, moeGT1, moeGT2, moeGT3, moeGT4, and moeGT5. For example, the Streptomyces ghanaensis strain may be Streptomyces ghanaensis ATCC14627.

In another embodiment, the present invention relates to an isolated recombinant cell expressing one or more polypeptides or fragments, or natural or artificial variants thereof selected from the group consisting of: moeA4, moeB4, moeC4, moeB5, moeA5, moeD5, moeJ5, moeE5, moeF5, moeH5, moeK5, moeM5, moeN5, moeO5, moeX5, moeP5, moeR5, moeS5, moeGT1, moeGT2, moeGT3, moeGT4, and moeGT5. In some embodiments, the cell is selected from the group consisting of: Streptomyces lividans TK24, E. coli, mammalian cells, yeast and insect cells.

In another embodiment, the compositions described herein further relate to a moenomycin A molecule derivative or intermediate produced by an isolated recombinant cell expressing one or more polypeptides or fragments, or natural or artificial variants thereof selected from the group consisting of: moeA4 moeB4, moeC4, moeB5, moe A5, moeD5, moeJ5, moeE5, moeF5, moeH5, moeK5, moeM5, moeN5, moeO5, moeX5, moeP5, moeR5, moeS5, moeGT1, moeGT2, moeGT3, moeGT4, and moeGT5.

In one embodiment, the present invention provides a moenomycin derivative having the structure:

wherein Ac refers to acetyl;

R and R¹ independently are selected from the group consisting of hydroxyl and —NHR² where R² is selected from the group consisting of hydrogen, alkyl, cycloalkyl, and substituted cycloalkyl;

X is hydrogen, or

where R³ is selected from the group consisting of hydrogen and hydroxyl; and

X¹ is hydrogen,

where R⁴ is selected from the group consisting of hydrogen and hydroxyl;

R⁵ is selected from the group consisting of hydroxyl and —NHR⁶ where R⁶ is hydrogen, alkyl, cycloalkyl, or substituted cycloalkyl, and

R⁷ is hydrogen or methyl,

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments, R and R¹ independently are —NH₂. In some embodiments, X is hydrogen. In other embodiments, X is

where R³ is selected from the group consisting of hydrogen and hydroxyl.

In some embodiments, X¹ is hydrogen. In other embodiments, X¹ is

In still other embodiments, X¹ is

where R⁴ is selected from the group consisting of hydrogen and hydroxyl and R⁵ is selected from the group consisting of hydroxyl and —NH₂.

In some embodiments, the structure of the moenomycin derivative is:

where R³ is selected from the group consisting of hydrogen and hydroxyl,

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments, R³ is hydrogen. In other embodiments, R³ is hydroxyl.

In some embodiments, the structure of the moenomycin derivative is:

where R⁴ is selected from the group consisting of hydrogen and hydroxyl,

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments, R⁴ is hydrogen. In other embodiments, R⁴ is hydroxyl.

In some embodiments, the structure of the moenomycin derivative is:

where R⁴ is selected from the group consisting of hydrogen and hydroxyl,

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments, R⁴ is hydrogen. In other embodiments, R⁴ is hydroxyl.

In some embodiments, the structure of the moenomycin derivative is:

where R⁴ is selected from the group consisting of hydrogen and hydroxyl,

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments, R⁴ is hydrogen. In other embodiments, R⁴ is hydroxyl.

In some embodiments, the structure of the moenomycin derivative is:

where R⁴ is selected from the group consisting of hydrogen and hydroxyl,

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments R⁴ is hydrogen. In other embodiments, R⁴ is hydroxyl.

In some embodiments, the structure of the moenomycin derivative is:

wherein R⁴ is hydrogen or hydroxyl and R⁶ is selected from the group consisting of hydrogen, alkyl, cycloalkyl, and substituted cycloalkyl,

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments, R⁴ is hydrogen. In other embodiments, R⁴ is hydroxyl.

In some embodiments, R⁶ is hydrogen or substituted cycloalkyl. In some preferred embodiments, substituted cycloalkyl is

In some embodiments, R⁴ is hydroxyl and R⁶ is

In some embodiments, the invention provides a pharmaceutical composition comprising the moenomycin derivative as defined above and a pharmaceutically acceptable carrier.

In another embodiment, the present invention provides a moenomycin derivative having the structure:

wherein

R⁷ and R⁸ independently are selected from the group consisting of hydroxyl and —NHR⁹ where R⁹ is hydrogen, alkyl, cycloalkyl, or substituted cycloalkyl; and

R¹⁰ is hydrogen or hydroxyl;

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments, R⁷ and R⁸ independently are —NH₂. In some embodiments, R¹⁰ is hydroxyl.

In some embodiments, the structure of the moenomycin derivative is:

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments, invention provides a pharmaceutical composition comprising the moenomycin derivative as defined above and a pharmaceutically acceptable carrier.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows the structure of moe A with the domains A-H involved in bioactivity and target interactions indicated in capital letters.

FIG. 2 shows interaction of moe A with the targets on the cell membrane.

FIGS. 3A-3B show schematics of the moe gene clusters 1 (FIG. 3A) and 2 (FIG. 3B), and the relative positions of the identified ORFs and genes along the clusters.

FIGS. 4A-4D show schematics of the moe A biosynthetic pathway. Dotted arrow line represents multiple biosynthetic steps (omitted on the scheme) leading from compound 8 to 22/23

FIGS. 5A-5B show Southern blots demonstrating the integration of vectors pSET52 (FIG. 5A) and pSOK804 (FIG. 5B) in the S. ghanaensis genome.

FIG. 6 shows a schematic for the generation of a conjugative plasmid that may be used for insertional gene inactivation.

FIG. 7 shows a schematic describing an approach for in silico screening involving classes of enzymes directed to producing different domains of moes.

FIGS. 8A-8B show schematics for the possible role of the moeO5 protein in moe biosynthesis. FIG. 8A shows biosynthesis of intermediate of archael membrane lipid. FIG. 8B shows biosynthesis of phosphoglyceric acid incorporation into moenomycin.

FIGS. 9A-9B show schematics for the insertional inactivation of the moeM5 gene (FIG. 9B) and confirmation of specific integration of the disruption sequences by Southern analysis (FIG. 9A).

FIGS. 10A-10B show schematics of the insertional inactivation of the moeGT1 gene (FIG. 10B) and confirmation of specific integration of the disruption sequences by Southern analysis (FIG. 10A).

FIGS. 11A-11D show bioassays (FIG. 11A), BioTLC (FIG. 11B) and liquid chromatography and mass spectrometry (“LC-MS”) data (FIG. 11C and FIG. 11D) indicating that in the moeGT1 mutant, moe A production appears to be reduced or abolished.

FIGS. 12A-12D show bioassays (FIG. 12A), BioTLC (FIG. 12B) and LC-MS data (FIG. 12C and FIG. 12D) of the moe A analog in the moeM5 mutant.

FIGS. 13A-13B show graphs of LC-MS analysis of moe extracts from wild-type (FIG. 13A) and moeM5 deficient strain OB20a (FIG. 13B).

FIG. 14 shows a bioassay of methanol extracts from 2 g of mycelia of strains S. lividans J1725 38-1⁺ (2) and S. lividans J1725 38-1⁺ plJ584⁺ (3). (1)—standard (moe A, 4 mcg).

FIG. 15A shows a Southern analysis of BamHI and XhoI digests of total DNA of wild type S. ghanaensis (lanes 2 and 4, respectively) and MO12 strain with disrupted moeGT3 (lanes 3 and 5). Lane 1—mixture of plasmids pMO12, pMO14 and pOOB58 underdigested with PstI. FIG. 15B shows the results of a bioassay of semipurified extracts from 1 g (wet weight) of mycelia of wild type strain (WT) and MO12. FIG. 15C presents a scheme of moeGT3 disruption in the S. ghanaensis genome. X, H, E mark XhoI, HindIII, EcoRI sites, respectively.

FIGS. 16A-16B show graphs of LC-MS analysis of moenomycin metabolites accumulated by S. ghanaensis MO12 strain. The final product is moenomycin C4 (1m) having Rt 9.2 min (FIG. 16A). The strain also accumulates its precursor lacking chromophore unit (2m; Rt 10.0 min) (FIG. 16B). Peaks corresponding to trisaccharide and disaccharide precursors of moenomycin C4 (3m and 4m, respectively) are observed. 5m is decarbamoylated derivative of 4m. 2m(dc) and 1m(dc) are doubly charged ions of 2m and 1m, respectively.

FIGS. 17A-17B show graphs of LC-MS analysis of moenomycin metabolites accumulated by S. lividans TK24 ΔmoeN5 strain (FIG. 17A). The final product is compound 23 having Rt 4.2 min (FIG. 17B). The strain also accumulates its monosaccharide precursors 2 and 3 (Rt 4.7-4.8 min). Structures of compounds 2, 3, 4, 8 and 23 are shown on FIGS. 4A-4D. Compounds 5 and 25d are decarbamoylated derivatives of 8 and 25, respectively. 23(dc) is doubly charged ion of 23.

FIGS. 18A-18B show LC-MS spectra of Mixed (−)-ESI-MS2 spectrum of compounds 22 and 23 produced by the S. lividans ΔmoeN5 strain (FIG. 18A) and the proposed fragmentation pathway of the compounds (FIG. 18B).

FIG. 19 shows the results of a disc diffusion assay of antibacterial activity of moe a intermediates against B. cereus. Spots 1 and 2—moe A (100 and 10 nM per disc, respectively); 3—compound 15 (100 nM); 4—compound 16 (100 nM), 5—compound 17 (100 nM), 6—compound 24 (100 nM), 7—mixture of compounds 22 and 23 (200 nM), 8—compound 11 (100 nM), 9—mixture of compounds 22 and 23 (50 nM), 10—mixture of compounds 2 and 3 (200 nM), 11—compound 1 (200 nM), 12—extract from 5 g S. lividans TK24 mycelial cake.

DETAILED DESCRIPTION Definitions

The definitions of certain terms as used in this specification are provided below.

As used herein, the “administration” of an agent or drug to a subject or subject includes any route of introducing or delivering to a subject a compound to perform its intended function. Administration can be carried out by any suitable route, including orally, intranasally, parenterally (intravenously, intramuscularly, intraperitoneally, or subcutaneously), rectally, or topically. Administration includes self-administration and the administration by another. It is also to be appreciated that the various modes of treatment or prevention of medical conditions as described are intended to mean “substantial”, which includes total but also less than total treatment or prevention, and wherein some biologically or medically relevant result is achieved.

As used herein, the term “alkyl” refers to monovalent saturated aliphatic hydrocarbyl groups having from 1 to 6 carbon atoms. This term includes, by way of example, linear and branched hydrocarbyl groups such as methyl (CH₃—), ethyl (CH₃CH₂—), n-propyl (CH₃CH₂CH₂—), isopropyl ((CH₃)₂CH—), n-butyl (CH₃CH₂CH₂CH₂—), isobutyl ((CH₃)₂CHCH₂—), sec-butyl ((CH₃)(CH₃CH₂)CH—), t-butyl ((CH₃)₃C—), n-pentyl (CH₃CH₂CH₂CH₂CH₂—), and neopentyl ((CH₃)₃CCH₂—).

As used herein, the term “amino” refers to the group —NH₂.

As used herein, the term “amino acid” includes naturally-occurring amino acids and synthetic amino acids, as well as amino acid analogs and amino acid mimetics that function in a manner similar to the naturally-occurring amino acids. Naturally-occurring amino acids are those encoded by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, γ-carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have the same basic chemical structure as a naturally-occurring amino acid, i.e., an α-carbon that is bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs have modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical structure as a naturally-occurring amino acid. Amino acid mimetics refers to chemical compounds that have a structure that is different from the general chemical structure of an amino acid, but that functions in a manner similar to a naturally-occurring amino acid. Amino acids can be referred to herein by either their commonly known three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature Commission. Nucleotides, likewise, can be referred to by their commonly accepted single-letter codes.

As used herein, “cycloalkyl” refers to a saturated or partially saturated cyclic group of 5 carbon atoms and no ring heteroatoms. The term “cycloalkyl” includes cycloalkenyl groups. Example of cycloalkyl group includes, for instance, cyclopentyl or cyclopentenyl.

As used herein, “substituted cycloalkyl” refers to a cycloalkyl group, as defined herein, having from 1 to 3 substituents selected from the group consisting of oxo and hydroxy, wherein said substituents are as defined herein. The term “substituted cycloalkyl” includes substituted cycloalkenyl groups.

As used herein, the term “ester” include formate, acetate, propionate, butyrate, acrylate, and ethylsuccinate derivatives of the compounds of the invention.

As used herein, “expression” includes but is not limited to one or more of the following: transcription of the gene into precursor mRNA; splicing and other processing of the precursor mRNA to produce mature mRNA; mRNA stability; translation of the mature mRNA into protein (including codon usage and tRNA availability); and glycosylation and/or other modifications of the translation product, if required for proper expression and function.

A “gene” includes a polynucleotide containing at least one open reading frame that is capable of encoding a particular polypeptide or protein after being transcribed and translated. Any of the polynucleotide sequences described herein may be used to identify larger fragments or full-length coding sequences of the gene with which they are associated. Methods of isolating larger fragment sequences are known to those of skill in the art, some of which are described herein.

A “gene product” includes an amino acid (e.g., peptide or polypeptide) generated when a gene is transcribed and translated.

As used herein, “hybridization” includes a reaction in which one or more polynucleotides react to form a complex that is stabilized via hydrogen bonding between the bases of the nucleotide residues. The hydrogen bonding may occur by Watson-Crick base pairing, Hoogstein binding, or in any other sequence-specific manner. The complex may comprise two strands forming a duplex structure, three or more strands forming a multi-stranded complex, a single self-hybridizing strand, or any combination of these. A hybridization reaction may constitute a step in a more extensive process, such as the initiation of a PCR reaction, or the enzymatic cleavage of a polynucleotide by a ribozyme.

Hybridization reactions can be performed under conditions of different “stringency”. The stringency of a hybridization reaction includes the difficulty with which any two nucleic acid molecules will hybridize to one another. Under stringent conditions, nucleic acid molecules at least 60%, 65%, 70%, 75% identical to each other remain hybridized to each other, whereas molecules with low percent identity cannot remain hybridized. A preferred, non-limiting example of highly stringent hybridization conditions are hybridization in 6× sodium chloride/sodium citrate (SSC) at about 45° C., followed by one or more washes in 0.2×SSC, 0.1% SDS at 50° C., preferably at 55° C., more preferably at 60° C., and even more preferably at 65° C.

When hybridization occurs in an antiparallel configuration between two single-stranded polynucleotides, the reaction is called “annealing” and those polynucleotides are described as “complementary”. A double-stranded polynucleotide can be “complementary” or “homologous” to another polynucleotide, if hybridization can occur between one of the strands of the first polynucleotide and the second. “Complementarity” or “homology” (the degree that one polynucleotide is complementary with another) is quantifiable in terms of the proportion of bases in opposing strands that are expected to hydrogen bond with each other, according to generally accepted base-pairing rules.

As used herein, “hydroxy” or “hydroxyl” refers to the group —OH.

As used herein, the terms “identical” or percent “identity”, when used in the context of two or more nucleic acids or polypeptide sequences, refers to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same (i.e., about 60% identity, preferably 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or higher identity over a specified region (e.g., nucleotide sequence encoding a moeA biosynthetic polypeptide of the invention as described herein or amino acid sequence of an moeA biosynthetic polypeptide described herein), when compared and aligned for maximum correspondence over a comparison window or designated region) as measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default parameters described below, or by manual alignment and visual inspection (see, e.g., NCBI web site). Such sequences are then said to be “substantially identical.” This term also refers to, or can be applied to, the compliment of a test sequence. The term also includes sequences that have deletions and/or additions, as well as those that have substitutions. As described below, the preferred algorithms can account for gaps and the like. Preferably, identity exists over a region that is at least about 25 amino acids or nucleotides in length, or more preferably over a region that is 50-100 amino acids or nucleotides in length.

An “isolated” or “purified” polypeptide or biologically-active portion thereof is substantially free of cellular material or other contaminating polypeptide or non-moeA-related agents from the cell from which the is derived, or substantially free from chemical precursors or other chemicals when chemically synthesized.

As used herein, the term “nucleotide pair” means the two nucleotides bound to each other between the two nucleotide strands.

As used herein, the term “oxo” refers to the atom (═O).

As used herein, the term “pharmaceutically-acceptable carrier” is intended to include any and all solvents, dispersion media, coatings, antibacterial and antifungal compounds, isotonic and absorption delaying compounds, and the like, compatible with pharmaceutical administration.

As used herein, the term “polynucleotide” means any RNA or DNA, which may be unmodified or modified RNA or DNA. Polynucleotides include, without limitation, single- and double-stranded DNA, DNA that is a mixture of single- and double-stranded regions, single- and double-stranded RNA, RNA that is mixture of single- and double-stranded regions, and hybrid molecules comprising DNA and RNA that may be single-stranded or, more typically, double-stranded or a mixture of single- and double-stranded regions. In addition, polynucleotide refers to triple-stranded regions comprising RNA or DNA or both RNA and DNA. The term polynucleotide also includes DNAs or RNAs containing one or more modified bases and DNAs or RNAs with backbones modified for stability or for other reasons. In a particular embodiment, the polynucleotide contains polynucleotide sequences from a moeA biosynthetic gene of the invention.

As used herein, the terms “polypeptide”, “peptide” and “protein” are used interchangeably herein to mean a polymer comprising two or more amino acids joined to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres. Polypeptide refers to both short chains, commonly referred to as peptides, glycopeptides or oligomers, and to longer chains, generally referred to as proteins. Polypeptides may contain amino acids other than the 20 gene-encoded amino acids. Polypeptides include amino acid sequences modified either by natural processes, such as post-translational processing, or by chemical modification techniques that are well known in the art. Such modifications are well described in basic texts and in more detailed monographs, as well as in a voluminous research literature. In a particular embodiment, the polypeptide contains polypeptide sequences from a polypeptide encoded by a moeA biosynthetic gene of the invention.

As used herein, the term “recombinant” when used with reference, e.g., to a cell, or nucleic acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been modified by the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic acid or protein, or that the material is derived from a cell so modified. Thus, e.g., recombinant cells express genes that are not found within the native (non-recombinant) form of the cell or express native genes that are otherwise abnormally expressed, under expressed or not expressed at all.

As used herein, the term “small molecule” means a composition that has a molecular weight of less than about 5 kDa and more preferably less than about 2 kDa. Small molecules can be, e.g., nucleic acids, peptides, polypeptides, glycopeptides, peptidomimetics, carbohydrates, lipids, lipopolysaccharides, combinations of these, or other organic or inorganic molecules.

As used herein, the term “subject” means that preferably the subject is a mammal, such as a human, but can also be an animal, e.g., domestic animals (e.g., dogs, cats and the like), farm animals (e.g., cows, sheep, pigs, horses and the like) and laboratory animals (e.g., monkey, rats, mice, rabbits, guinea pigs and the like).

As used herein, the term “substitution” is one of mutations that is generally used in the art. Those substitution variants have at least one amino acid residue in the polypeptides encoded by the moeA biosynthetic genes of the invention replaced by a different residue. The sites of greatest interest for substitutional mutagenesis include the active sites or regulatory regions of the polypeptides encoded by the moeA biosynthetic genes of the invention. “Conservative substitutions” are shown in the Table below under the heading of “preferred substitutions”. If such substitutions result in a change in biological activity, then more substantial changes, denominated “exemplary substitutions” in Table 1, or as further described below in reference to amino acid classes, may be introduced and the products screened.

TABLE 1 Amino Acid Substitutions Preferred Original Residue Exemplary Substitutions Substitutions Ala (A) val; leu; ile val Arg (R) lys; gln; asn lys Asn (N) gln; his; asp, lys; arg gln Asp (D) glu; asn glu Cys (C) ser; ala ser Gln (Q) asn; gln asn Glu (E) asp; gln asp Gly (G) ala ala His (H) asn; gln; lys; arg arg Ile (I) leu; val; met; ala; phe; norleucine leu Leu (L) norleucine; ile; val; met; ala; phe ile Lys (K) arg; gln; asn arg Met (M) leu; phe; ile leu Phe (F) leu; val; ile; ala; tyr tyr Pro P) ala ala Ser (S) thr thr Thr (T) ser ser Trp (W) tyr; phe tyr Tyr (Y) trp; phe; thr; ser phe Val (V) ile; leu; met; phe; ala; norleucine leu

Once such variants are generated, the panel of variants is subjected to screening as described herein and variants with similar or superior properties in one or more relevant assays may be selected for further development.

As used herein, the term “effective amount” or “pharmaceutically effective amount” or “therapeutically effective amount” of a composition, is a quantity sufficient to achieve a desired therapeutic and/or prophylactic effect, e.g., an amount which results in the prevention of, or a decrease in, the symptoms associated with a disease or condition that is being treated, e.g., the conditions associated with a bacterial infection. The amount of a composition of the invention administered to the subject will depend on the type and severity of the disease and on the characteristics of the individual, such as general health, age, sex, body weight and tolerance to drugs. It will also depend on the degree, severity and type of disease. The skilled artisan will be able to determine appropriate dosages depending on these and other factors. The compositions of the present invention can also be administered in combination with one or more additional therapeutic compounds. For example, a “therapeutically effective amount” of moe A derivatives is meant levels in which effects of bacterial infection are, at a minimum, ameliorated.

As used herein, the term “pharmaceutically acceptable salt” refers to pharmaceutically acceptable salts derived from a variety of organic and inorganic counter ions well known in the art and include, by way of example only, sodium, potassium, calcium, magnesium, ammonium, and tetraalkylammonium, and when the molecule contains a basic functionality, salts of organic or inorganic acids, such as hydrochloride, hydrobromide, tartrate, mesylate, acetate, maleate, and oxalate. Suitable salts include those described in P. Heinrich Stahl, Camille G. Wermuth (Eds.), Handbook of Pharmaceutical Salts Properties, Selection, and Use; 2002.

As used herein, the term “tautomer” refers to alternate forms of a compound that differ in the position of a proton, such as enol-keto and imine-enamine tautomers.

As used herein, the terms “treating” or “treatment” or “alleviation” refers to both therapeutic treatment and prophylactic or preventative measures, wherein the object is to prevent or slow down (lessen) the targeted pathologic condition or disorder. A subject is successfully “treated” for a disorder characterized by bacterial infection if after receiving a therapeutic amount of a moe A derivative according to the methods of the present invention, the subject shows observable and/or measurable reduction in or absence of one or more signs and symptoms of a particular disease or condition. For example, for infection, inhibition (i.e., slow to some extent and preferably stop) of bacterial growth; and/or relief to some extent, of one or more of the symptoms associated with the specific infection; reduced morbidity and mortality, and improvement in quality of life issues.

General

The methods and compositions described herein relate to the identification, isolation, and characterization of genes which encode proteins useful for the biosynthesis of TG inhibitors such as moes, and their homologues. The methods and compositions also relate to the production of such proteins, and their use in the synthesis of moes, the production of modified moes, and the altered expression (e.g., overexpression) of moes. In one embodiment, the moe is moe A.

The methods and compositions also relate to the mutation, disruption, and expression of genes involved in the moe biosynthetic pathway. The present disclosure describes the isolation of gene clusters for moe biosynthesis from S. ghanaensis ATCC14672 (a.k.a., “moe biosynthesis-related genes”) as well as the insertional inactivation of certain moe biosynthetic genes as evidence of (i) cloning of moe biosynthesis gene clusters and (ii) the potential for generating bioactive moe derivatives through mutagenesis. The present disclosure also describes manipulations of regulatory genes to improve moe production in S. ghanaensis ATCC14672. The present disclosure also describes the heterologous expression of moe biosynthesis related genes in S. lividans TK24.

The moe biosynthesis-related genes of the present invention are useful for the chemoenzymatic generation of clinically valuable moe derivatives. For example, the moe biosynthesis-related genes of the invention are useful for the development of analogs suitable for use in humans. They are valuable tools for the chemoenzymatic synthesis of novel bioactive molecules and chemical probes. Additionally, manipulation of the moe biosynthesis-related genes of the present invention in cellular expression systems is useful for the generation of moe production and enrichment. In one embodiment, the moe biosynthesis-related genes of the invention are manipulated for overexpression of moes in a prokaryote.

The moe biosynthesis-related genes of the present invention may be selected from the group consisting of moeA4 moeB4, moeC4, moeB5, moe A5, moeD5, moeJ5, moeE5, moeF5, moeH5, moeK5, moeM5, moeN5, moeO5, moeX5, moeP5, moeR5, moeS5, moeGT1, moeGT2, moeGT3, moeGT4, and moeGT5. Exemplary polynucleotide sequences of moe biosynthesis-related genes include SEQ ID NOS: 3-25. Exemplary polypeptide sequences of moe biosynthesis-related genes include SEQ ID NOS: 26-48.

In one embodiment, the moe biosynthesis-related genes encode polypeptide fragments or variants of a natural moe biosynthesis-related polypeptide, wherein the variants comprise one or more conservative or non-conservative amino acid substitutions, additions or deletions compared to the natural moe biosynthesis-related polypeptide. A moe polypeptide (a.k.a., moe biosynthesis-related polypeptide”) in its native state is related to the biosynthesis of moes (e.g., moenomycin A), intermediates, derivatives or homologues, thereof. Variant polypeptides can be prepared by altering the sequence of polynucleotides that encode the natural moe biosynthesis-related polypeptide sequence. This is accomplished by methods of recombinant DNA technology well know to those skilled in the art. For example, site directed mutagenesis can be performed on recombinant polynucleotides encoding the moe biosynthesis-related polypeptide sequence to introduce changes in the polynucleotide sequence so that the altered polynucleotide encodes the peptides of the invention.

In some embodiments, the variants are at least about 85% identical, at least about 90% identical, or at least about 95% identical to the corresponding natural moe biosynthesis-related polypeptide. Typically, the variant moe biosynthesis-related polypeptides retain at least about 25%, at least about 50%, at least about 75%, at least about 80%, at least about 90%, or at least about 95% of the the biological activity of the natural polypeptide. The biological activity of the variant polypeptides and the natural polypeptide may be assayed according to the methods described herein, e.g. the type and/or quantity of moes or moe derivatives/intermediates produced by a heterologous host expressing the variant sequence may be compared to the type and/or quantity of moes or moe derivatives/intermediates produced by the same organism expressing the natural polypeptide.

As used herein the term “moe intermediate” or “moenomycin intermediate” encompasses moe-related biosynthesis precursor molecules and/or metabolites of moe biosynthesis. Alternatively, the variant polypeptides may possess a biological activity different from the natural polypeptide. For example, variations in the primary amino acid sequence may affect the binding of a moe biosynthesis-related polypeptide to one or more substrates, thereby providing moe derivatives different from those produced by the natural polypeptide. Assessing whether variant polypeptides have activity different from the natural polypeptide may also be performed according to the methods described herein. For instance, the variant polypeptide may be expressed in a heterologous host and the appearance of a new moes or moe derivatives may be observed using LC-MS analysis.

Although some portions of the discussion and examples focus on S. ghanaensis ATCC14672 or S. lividans TK24, it is understood that any bacterial strain or mammalian cell line which produces moe, moe-like compounds, or which includes homologues of the genes identified for moe biosynthesis, may also be used. Thus, using the teachings described herein, moe biosynthesis related genes may be expressed in heterologous systems including, but not limited to bacterial (e.g., Streptomyces sp., E. coli), mammalian (e.g., mouse, human, rat, hamster, etc., such as NIH-3T3, HeLa, HEK 293, etc.), yeast (e.g., Saccharomyces cerevisiae, Pichia pastoris) and insect cells (e.g., Drosophila melanogaster Schneider cells to generate moes and moe derivatives.

In preparing the recombinant expression constructs to express moe biosynthesis-related genes in heterologous hosts, the various polynucleotides of the present invention may be inserted or substituted into a bacterial plasmid-vector. Any convenient plasmid may be employed, which will be characterized by having a bacterial replication system, a marker which allows for selection in a bacterium and generally one or more unique, conveniently located cloning sites. Numerous plasmids, also referred to as vectors, are available for transformation. Suitable vectors include, but are not limited to, the following: viral vectors, such as lambda vector system gt11, Charon 4, and plasmid vectors such as pBR322, pBR325, pACYC177, pACYC1084, pUC8, pUC9, pUC18, pUC19, pLG339, pR290, pKC37, pKC101, SV40, pBluescript II SK+/−. or KS+/− (Stratagene, La Jolla, Calif.), and any derivatives thereof. Also suitable are yeast expression vectors, which may be highly useful for cloning and expression. Exemplary yeast plasmids include, without limitation, pPICZ, and pFLD. (Invitrogen, Carlsbad, Calif.). The selection of a vector will depend on the preferred transformation technique and target host cells.

The nucleic acid molecules encoding moe biosynthesis-related genes are inserted into a vector in the 5′ to 3′ direction, such that the open reading frame is properly oriented for the expression of the encoded protein under the control of a promoter of choice. In this way, the moe biosynthesis structural gene is said to be “operably linked” to the promoter. Single or multiple nucleic acids may be inserted into an appropriate vector in this way, each under the control of suitable promoters, to prepare a nucleic acid construct of the present invention.

Certain regulatory sequences may also be incorporated into the expression constructs of the present invention. These include non-transcribed regions of the vector, which interact with host cellular proteins to carry out transcription and translation. Such elements may vary in their strength and specificity. Depending on the vector system and host utilized, any number of suitable transcription and/or translation elements, including constitutive, inducible, and repressible promoters, as well as minimal 5′ promoter elements may be used.

A constitutive promoter is a promoter that directs constant expression of a gene in a cell. Examples of some constitutive promoters that are widely used for inducing expression of heterologous polynucleotides include the ADH1 promoter for expression in yeast, those derived from any of the several actin genes, which are known to be expressed in most eukaryotic cell types, and the ubiquitin promoter, which is the promoter of a gene product known to accumulate in many cell types. Examples of constitutive promoters for use in mammalian cells include the RSV promoter derived from Rous sarcoma virus, the CMV promoter derived from cytomegalovirus, β-actin and other actin promoters, and the EF1α promoter.

Also suitable as a promoter in the plasmids of the present invention is a promoter that allows for external control over the regulation of gene expression. One way to regulate the amount and the timing of gene expression is to use an inducible promoter. Unlike a constitutive promoter, an inducible promoter is not always optimally active. An inducible promoter is capable of directly or indirectly activating transcription of one or more DNA sequences or genes in response to an inducing agent (or inducer). Some inducible promoters are activated by physical means, such as the heat shock promoter (HSP), which is activated at certain temperatures. Other promoters are activated by a chemical means, for example, IPTG. Other examples of inducible promoters include the metallothionine promoter, which is activated by heavy metal ions, and hormone-responsive promoters, which are activated by treatment of certain hormones. In the absence of an inducer, the nucleic acid sequences or genes under the control of the inducible promoter will not be transcribed or will only be minimally transcribed. Promoters of the nucleic acid construct of the present invention may be either homologous (derived from the same species as the host cell) or heterologous (derived from a different species than the host cell).

Once the nucleic acid construct of the present invention has been prepared, it may be incorporated into a host cell. This is carried out by transforming or transfecting a host or cell with a plasmid construct of the present invention, using standard procedures known in the art, such as described by Sambrook et al., Molecular Cloning: A Laboratory Manual, Third Edition, Cold Spring Harbor: Cold Spring Harbor Laboratory Press, New York (2001). Suitable hosts and cells for the present invention include, without limitation, bacterial cells, virus, yeast cells, insect cells, plant cells, and mammalian cells, including human cells, as well as any other cell system that is suitable for producing a recombinant protein. Exemplary bacterial cells include, without limitation, E. coli and Mycobacterium sp. Exemplary yeast hosts include without limitation, Pichia pastoris, Saccharomyces cerevisiae, and Schizosaccharomyces pombe. Methods of transformation or transfection may result in transient or stable expression of the genes of interest contained in the plasmids. After transformation, the transformed host cells can be selected and expanded in suitable culture. Transformed cells are first identified using a selection marker simultaneously introduced into the host cells along with the nucleic acid construct of the present invention. Suitable markers include markers encoding for antibiotic resistance, such as resistance to kanamycin, gentamycin, ampicillin, hygromycin, streptomycin, spectinomycin, tetracycline, chloramphenicol, and the like. Any known antibiotic-resistance marker can be used to transform and select transformed host cells in accordance with the present invention. Cells or tissues are grown on a selection medium containing an antibiotic, whereby generally only those transformants expressing the antibiotic resistance marker continue to grow. Additionally, or in the alternative, reporter genes, including, but not limited to, β-galactosidase, β-glucuronidase, luciferase, green fluorescent protein (GFP) or enhanced green fluorescent protein (EGFP), may be used for selection of transformed cells. The selection marker employed will depend on the target species.

Expression is induced if the coding sequences is under the control of an inducible promoter. To isolate the protein, the host cell carrying an expression vector is propagated, homogenized, and the homogenate is centrifuged to remove bacterial debris. The supernatant is then subjected to sequential ammonium sulfate precipitation. The fraction containing the protein of the present invention is subjected to gel filtration in an appropriately sized dextran or polyacrylamide column to separate the proteins. If necessary, the protein fraction may be further purified by HPLC. Alternative methods of protein purification may be used as suitable. See J. E. Coligan et al., eds., Current Protocols in Protein Science (John Wiley & Sons, 2003). Upon obtaining the substantially purified recombinant protein, the protein may be administered to a subject as described herein.

Unless defined otherwise, all technical and scientific terms used herein generally have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. As used in this specification and the appended claims, the singular forms “a”, “an” and “the” include plural referents unless the content clearly dictates otherwise. For example, reference to “a cell” includes a combination of two or more cells, and the like. Generally, the nomenclature used herein and the laboratory procedures in cell culture, molecular genetics, organic chemistry, analytical chemistry and nucleic acid chemistry and hybridization described below are those well known and commonly employed in the art. Standard techniques are used for nucleic acid and peptide synthesis. Standard techniques, or modifications thereof are used for chemical syntheses and chemical analyses.

It will be readily apparent to one skilled in the art that varying substitutions and modifications may be made to the invention disclosed herein without departing from the scope and spirit of the invention. The invention illustratively described herein suitably may be practiced in the absence of any element or elements, limitation or limitations which is not specifically disclosed herein. The terms and expressions which have been employed are used as terms of description and not of limitation, and there is no intention that in the use of such terms and expressions of excluding any equivalents of the features shown and described or portions thereof, but it is recognized that various modifications are possible within the scope of the invention. Thus, it should be understood that although the present invention has been illustrated by specific embodiments and optional features, modification and/or variation of the concepts herein disclosed may be resorted to by those skilled in the art, and that such modifications and variations are considered to be within the scope of this invention.

In addition, where features or aspects of the invention are described in terms of Markush groups or other grouping of alternatives, those skilled in the art will recognize that the invention is also thereby described in terms of any individual member or subgroup of members of the Markush group or other group.

Also, unless indicated to the contrary, where various numerical values are provided for embodiments, additional embodiments are described by taking any 2 different values as the endpoints of a range. Such ranges are also within the scope of the described invention.

All references, patents, and/or applications cited in the specification are incorporated by reference in their entireties, including any tables and FIGS., to the same extent as if each reference had been incorporated by reference in its entirety individually.

Moe Biosynthesis-Related Genes of the Invention

The candidate S. ghanaensis moe biosynthetic genes were identified by in silico scanning of shotgun sequenced fragments of the S. ghanaensis genome. As a result of genome scanning, 4 neighboring contigs were identified. Two of the identified genes, moeD5 and moeGT3, were used as a hybridization probes to retrieve additional overlapping cosmids from a genomic library of S. ghanaensis that cover one of the moe gene clusters. Physical mapping and partial sequencing of the cosmids confirmed that the in silico assembly of contigs coincided with their localization on the chromosome. A detailed discussion of the methods and procedures outlined above is provided in the Experimental Examples, Sections II and III.

Two gene clusters (e.g., gene cluster 1 and gene cluster 2) were identified which contain the moe biosynthesis-related genes of the invention. Sequence data indicated at least 1.3 Mbp separate gene cluster 1 from cluster 2. The polynucleotide sequence of gene cluster 1 (SEQ ID NO: 1) is shown in Table 2. The polynucleotide sequence of gene cluster 2 (SEQ ID NO: 2) is shown in Table 3.

TABLE 2 Complete DNA sequence of S. ghanaensis ATCC14672 moe (moe) biosynthesis gene cluster 1. CGCGCCCCTCCGGAGGCTGTCCGGAAGGGCGCGGCACGGGTGGGGGCGGGTCGGCCGGGTTCCGCCGGCCGGGTCGGGTC AGCCGACGACGGCCGGCTGTGCGGGCGCCTCCCGCGGGCCGCCGCTGATCGGCAGGACCTCGGCGTCCGAGGTCCGCACG AAGCGGGGACCGTCCGGCCCGTACACCCCGCGCGGCTCCGGGAACAGGGTGAGCAGGACGAGGTAGAGCACCGCCGCCAG GGCCAGGCCGACCGGCAGCGAGACGTCCATGCCGTCGGCCAGGTCGCCGAGCGGTCCGACGAACTGACCGGGCAGGTTGG TGAAGAGCAGGGCCACGGCGGCGGAGACCAGCCAGGTGGCCAGGCCGCGCCAGTTCCACCCGTGGGCGAACCAGTAGCGG CCGCCGGTGCGGCGCTGGTTGAAGACCTGGAGCGCCTCCGGGTCGTACCAGCCCCGGCGGGTGACGTAGCCGAGGATCAT CACGACCATCCACGGCGCGGTGCAGGTGATGATCAGGGTGGCGAAGGTGGAGATCGACTGCGTGAGGTTCAGCCAGAAGC GGCCGGCGAAGATGAACACGATGGACAGCACGCCGATGAAGATCGTCGCCTGAACGCGGCTGAAGCGGGTGAAGACGCTG CAGAAGTCCAGTCCGGTGCCGTACAGCGCGGTCGTGCCGGTGGACAGGCCGCCGATCAGCGCGATGAGGCAGAGCGGCAG GAAGTACCAGCCGGGCGCGATCGCGAGCAGGCCGCCGACGTAGTTGGGCGCGTCGGGGTCGAGGTACGCCCCGGCCCGGG TGGCGATGATCGACGCGGTGGCCAGGCCGAAGACGAACGGCAGCAGCGTGGCGATCTGGGCCAGGAACGCCGCGCCCATC ACCCGGCGGCGGGGGGTCGCGGCCGGGATGTAGCGGGACCAGTCGCCGAGGAACGCGCCGAAGGAGACCGGGTTGGAGAG CACGATCAGTGCGGACCCGATGAAGGACGGCCAGAAGAGCGGGTCCGCGGTCGAGGCGAAGGTGCCCGCGTAACCGGGGT CGAAGTCGCCGGCGAAGGCGAAGGCGCCCAGCACGAACAGGGCCGAGGCCGCCACCACGGCGATCTTGTTGACCAGCAGC ATGAAGCGGAAGCCGTAGATGCAGACCACCAGCACGAGACCGGCGAAGACCGCGTAGGCCAGGGCGTACGTCACCGTCGA CTCGGGGACTCCCAGCAACCGGTGCGCGCCGCCGACCAGGGCGTCCCCCGAGGACCACACCGAGATCGAGAAGAAGGCGA TCGCGGTGAGCAGCGCGAGGAAGGAGCCGACGACCCTTCCGTGCACCCCCAGGTGGGCGGAGGAGGAGACGGAGTTGCTG GTGCCGTTCGTGGGCCCGAAGAGCGCCATGGGCGCGAGCAGCAGCGCGCCCGCGACCAGGCCGAGCAGCGTCGCCGCGAG CCCCTGCCAGAAGGAGAGGCCGAACAGGATGGGGAAGGTGCCCAGCACACAGGTGGCGAAGGTGTTGGCGCCGCCGAAGG CGAGGCGGAACAGATCGAGCGGACGGGCCGTGCGGTCCTCGTCCGGAATCTGCTCGACACCGTATGTCTCGATGTCGGTG ACCGCGGTCTTCACGGGATCTCCTCCTTCTGTTCACGCCCCGGGGGATGGCCCCACAGTCTGAATCCCCCCACTGACCTG CGACAACTGTGTCAATCACAGAGAGGTAGCCTGCTTTATGTGGCCACCAACAAACTGACCGTCGAGGATCTGCTCTCCTT CCCCGCCCTCCAGCTGACGCTGCGGGCGGGGAAGAGTGGACTCTCACGCTCCGTTTCCTGGGCCCACACCAGCGAGTTGG CCGATCCGACCCCCTGGCTGCTGGGGGCCGAGGTGATCATGACGACGGGGCTCGCGATCCCCCGCACCGCGACCGGGCAG CGCCGCTATCTGGAGCGGCTGGACGACGCCGGGGTCTCCGCGCTGGCCCTGTCGGCGCAGCTGCACATGCCGCCGCTGCA CGACGCGTTCTTCAAGGCGGCCGAGGAACGGGGCTTCCCCGTCCTGGAGGTGCCGCTCGCCGTTCCGTTCATCGCGGTCT CCCAGGAGGTCGCCGCCGCGGTGCAGGAGGACGCCCGGCACCGGCTGGGCGCGCAGCTGCAGGTCTTCGGCTCGCTGCGC TGGATGGTCGCCGAGGACCTCGACACCCCGACCCTCCTGCGCCGCCTCGAGCGCCTGTCCGGGTACAACGTCTTCCTCTG CACCCCGCAGGGCCGCCCGCTGCTGCCCGGGGTGCCCACCCCCGACCCGGGCGTGCTGCCCGCCTCGGTGGACGCCCCGC CGACCGTCCCCGGCGGTTTCGTCCTGCCCGTGCCGGCACCGGGCGGTCCGGCCGGTTTCCTGGTGGCGTACGAGAGGCAG GGCGCCCAGCCCGCCGGGCTCGCGGTCGTCCAGCACATCGCCACGGTGGCGGCGCTGCGGCTGGCGATGGTGCGCAACGA ACGCGAGACGCTGCGCCGCGAGGGCGCCGAGACCCTCGCCGAACTGCTCCGGGAGGTGCTCGACCCGGACGCCGCCCGCC GCCGGCTCGCCCGGCACGCGATCGAGGGCGAGACCGTGCTGCTCGTGGTCCGGAACACCACCGACGAGGCACTGCTGCAC TGCCTGGAGGACCGCCCCCACCTGCTGCTCACCCGGGGCGACGACCGGTACGTGCTCGGGGCCCCGGAGCTGGCCCCGGC GATCGGCGAACTGCCCGGGGTGGCGGCCGGGATGAGCCGCGCCCTTCCGCCGGGCGCGGCCCTGAAGGTCGCCGAGCGCG AGGCCCTGTGGGCGCTGAGCAAGGCGGTCGAGTCGGGCCGCCCCCTGGTCCGCTACGGCGACGACGCGACGGGCCGCTGG CTGCCGGAGGACCCCGCGGTGCTGAGCGCGCTGGTCGAGCACGTCCTCGGCGAGGTGCTGCGCTACGACCTGGCCCACGG CTCCCAGCTCCTGGTCTCCGTGCGCACCTGGCTGGAGCGCGACCGCCGTACGGAGACCGCCGCGGCCGCCCTCCACATCC ACCCCAACACGCTCGCCTACCGGCTGCGCCGCTTCGGCGCCCTCTCCCGGCGCGACCTGTCCTCGACCGGCGCGCTGGCG GAGGTCTGGCTGGCGATCCAGGCGGCCGGGACGCTGGGGCTCACCGACTGAGCGCGCCGGACACCGGCCCCGGGCGGGGA CACGGACCGGGCGGCGCGCACCGCCTCCGAGGCGTTCGTCCGGGCCGGGCCGCCCCCGCCGGCGGGGAACCGGCCGGGGC CGTCACCCCTCGGCCGGGCCGCCCCCGCCTCGTCGGACGACCGGAGCCGGACGGCCGGCTCCCCCTTGAGGACCTTGCCG CCGGGTCCCAGGGGAAGCCGGTCGGGGAGGACCACCCGCCGCGGACGCCGGTACGCCGCGATGTGCCGCTCACCCCACGC CACGACGGAATCCGCCAGTGCCCCGTCCGGTGTCGGGCCGTCCCGTGGCACCACCACCGCGCACACCTCCTGGCCGTACA CCGGGTCGGGGAGCCCCACCACCGCGACCCGGGCGACCGCCGGATGGCGGAGCAGCGCTTCCTCGACCTCACGGGGATGG ACGTCGTACCCGCCCCGCAGGATCACGTCCTTCTCGCGGTCGACGGCGGTGGGGTACCCCTCCTCGTCCAGCAGCCCTAG ATCGCCGGTGCGGAACCAGCCGTCCACGAACGCGGCGGCCGTGGCGCGGGGGGCGTCGACGTACCCGGCCATCAGGTTGT GCCCGCGGACGACGATCTCGCCCACCTCGCCGGAGGGAGCGGCCCGACGGCGTCCTCCGCCTCGGACCGGGGCTGTCGCG ACCTCCACGCCCCAGAGGTGATCCGCTCGGCGCCCGCGACCACCGCGGTGCGCTCCGGCCACCGGCCCGCGGAGTCCGAG AGGATCGTGGCCGCCGAGAGGGTCGCCCCCGGGCCCGCCCACGTGTTCCCGCCGTCGGTGACGGCTCCGGAATCCGTCGT GAACGGCCCTGCCGGAGGTTTTCGTACGCGCTCGTCGCGACTCCGCCTCGCTTGCCGGTAATCGGCTTCCATCGGCCGGA CGACAGCATGAGACGTCTTCTGTGCAAGACCCGCGGTGGATCCCAGGATGAGACCGGCCCGAAGGGTAGCGAAAGGAGCG GACCTTGGACATCTCCTCGTCCATGGACTTCTTCGTGCGACTCGCCCGCGAAACCGGTGACCGGAAGAGGGAGTTTCTCG AACTCGGCCGCAAGGCGGGTCGGTTCCCCGCGGCGAGCACCTCGAATGGCGAGATTTCCATCTGGTGCAGCAACGACTAC CTGGGTATGGGGCAGCACCCGGACGTCCTCGACGCCATGAAGCGCTCCGTGGACGAATACGGCGGAGGATCCGGGGGTTC GCGGAACACAGGCGGAACCAACCACTTCCATGTGGCTCTGGAGCGGGAGCCGGCCGAGCCGCACGGAAAGGAGGACGCCG TTCTCTTCACCTCGGGGTATTCCGCCAATGAGGGATCCCTGTCGGTTCTGGCCGGGGCCGTCGACGACTGCCAGGTCTTC TCGGATTCGGCGAACCACGCGTCCATCATCGACGGTTTACGGCACAGCGGCGCCCGCAAGCACGTATTCCGGCACAAGGA CGGGCGGCATCTGGAGGAGTTGCTGGCCGCGGCCGACCGGGACAAGCCGAAGTTCATCGCCCTGGAGTCCGTGCATTCGA TGCGGGGCGACATCGCGCTCCTGGCCGAGATCGCCGGCCTGGCCAAGCGGTACGGAGCGGTCACCTTCCTCGACGAGGTG CACGCGGTCGGCATGTACGGCCCGGGCGGAGCGGGCATCGCGGCCCGGGACGGCGTGCACTGCGAGTTCACGGTGGTGAT GGGGACCCTCGCCAAGGCCTTCGGCATGACCGGCGGCTACGTGGCGGGACCGGCCGTGCTCATGGACGCGGTGCGCGCCC GGGCCCGTTCCTTCGTCTTCACCACGGCGCTGCCGCCGGCGGTCGCGGCGGGCGCGCTCGCCGCGGTGCGGCACCTGCGC GGCTCGGACGAGGAGCGGCGGCGGCCGGCGGAGAACGCGCGGCTGACGCACGGCCTGCTCCGCGAGCGGGACATCCCCGT GCTGTCGGACCGGTCCCCCATCGTCCCGGTGCTGGTCGGCGAGGACCGGATGTGCAAGCGCATGTCGGCCCTGCCGCTGG AGCGGCACGGCGCGTACGTCCAGGCCATCGACGCGCCCAGCGTCCCGGCCGGCGAGGAGATCCTGCGGATCGCGCCCTCG GCGGTGCACGAGACCGAGGAGATCCACCGGTTCGTGGACGCCCTGGACGGCATCTGGTCCGAACTGGGGGCCGCCCGGCG CGTCTGACGCCCCGCAGTGTCACCCCGCGGGAGGGCTCTGCGGAGCGGGCCCGGCGTCCCCGCCCCCCGGACCCGCACCC GTCCAGATCCGGCCCATCTCGGCGGAGACCGCCATGACCTCCTCGAAGGTGCCCGAGGCCTCCACCCGCCCGCCCTCGAG CACCACCACGCGGTCGGCCGCGCGCAGCAGAGCGGGCCGGTGGGAGACCGCGAGCACGGTCCGCGTCCCGTCCAGCAGCC TCTCCCACAGCAGGTGCTCGGTCTCCGGGTCCAGGGCACTGGAGACGTCGTCCAGCACCACGAGTTCGGGGTCGCCGACC AGCATGCGGGCGATCGCGACCCGCTGGATCTGCCCGCCCGAGAGGCGCAGGCCCCGCGGGCCCACCACGGTGTCCGGGCC GTCCTGCATCGCCGCCAGGTCGGGCTCCGCCACGGCGAGGCGCACGGCCTCGTCGAAGGCCGCGCCGTCCCGGCCCAGCA GGACGTTCTCCCGCACCGTCCCGCTGAACAGACACGGGACCTGCGGGGTGTACCCGCAGCGCGGCGCCACCAGGAACGAC GCGGGGTCGGCGATCGGTTCGCCGTTCCACAGCACGGTGCCCCGCTCGTGCGGGAGCAGTCCGAGGACGGCCCGGACCAG GGTGCTCTTGCCGGAACCGACCCGGCCGGTGACCACGGTGACGGTGTGCCGCTCCACCACCAGGTCCACGTCCTCTATGC CGTGCCCCGCCCCGGGGTGGCGGGCCGTCAGCCCGCGCACGGCCAGTTCCCGCAGGGGCGGGGCGGGCTCCGGCCCGGCG TCCGGGGCGGCGGCCCCCTCGCCGGTCCCTCCCGGCGCGTCGGACGCGATCGGCGGACTGGCCCGCTCCAGGGACCGCCG CAGCCGGCAGCCGAGGTTGTTGGTGATCCGGCCGAGCGCCACCGAGACCCGCTGCAACCGCACGGACAGCATGCCGATCG ACCCCAGGGCCTCGGTCAGGATCTGCAGGTAGAAGGCGAACAGGGCGAGATCGCCGACGCTGAAGGTCCCCTCGTCCATC CGCCCCGCGACCAGCAGCAGCACCACGCCGACCCCGATCGGGGCCGGGTTGCCGATCACCGTGCGCTGGACGACGGCGTA CAGCTCCTCCCGCACCGCGGCTTCGGCACGGGCGCCGTTCAGCCCGGCGACGTGCGCGGCGACCTGCGGCTCGGCGGCGG CGGCCTGCACCGCGCCCACCGCGCCCACCATCTCCCGCAGGGCTCCCGCCACCTCCCCGGACGCGGCCCGGGTGGCCCGC CGGTGCCGCAGGAACCGGCTGTGGGCCAGCGCGGTGACCAGCGTCAGCAGGACGAGGAGGGCGAGGAGGGCGCCGGTGAC CACGGCGTCGATCCGCATCATCACCGTGACCGACGCGGCGACGAACAGCCAGTGGGCGAGGTTCGTCGGCGCCCAGGCGA CGAAGAACCCCGTCTCGTCGACGTCCTCGCCCACCGTTCGCAGGGACTCGCCGGGGCTGGTGCGGGCCGTCACCTCCGAC CCGCGCAGGGCCGATCCCAGCAGGGCGTGCCGCAGCCGCGCCGTGGTGCCGTACTGGACCCGCGGCTCCAGCCTGTTGAT CATCACGCCGAACTGGAGGAACAGCCGTCCCGCCTCGATCGCGGCCACCAACGCGATGATCAGCCACACGCCCCCGCCCG CGCCCAGCGCGTCGAACAGCCGCTGGAACAGCAGGCCCACCACCAGGGTTCCCGCCCGCAGCAGGACCCACAGACCGGTG AGGGTCCAGTAGGTGCGGGCCGAGCCGCGCAGCACGTCCGCGAGCCGTCCCAGGACGGACTGCCGCCCCGCTCCGTCGTC CGCCCGCCTCGCTCTCCCTCCTGCCCGCTCCGCTCCGTCGTCCGCCCGCCTCGCTCTCCCCCCTGCCCGCTCCGCTCCGT CGTCCGCCCGCTCCACTCCTGGCACCCCGCTCCCCCTCCCGTCCCCGTTCCTCACCGGGTGGCTCCGGCCGTGCGGAGGA GTGCGTGGAAGCGCGACCCGGGATCGGCGGCGAGGACCCTCCGCTCCCCCTCCTCGGCGACCTTCCCCTCCTCCAGCACC AGGATCCGGTCGACGTTCCGGAGCAGGTGCGGGCGGTGCTCCACCACGACGGCGGTGCGGCCCTCGAGCAGCCGCTCCAG CGCGGGCATCAGGAGCCGCTCGCTGTACGGATCCAGCCGGGCCGTCGGCTCGTCCATCAGGACCAGCCCCGGATCGCGCA GGAACACCCTGGCCAGCGCGAGCTGCTGCTCCTCGCCCGCGGACATGCCGCGGGCCCCGGCGCCGAGCGGCGTGTCCAGA CCGTCGGGCAGGGTGCGCAGCCAGGGGCCGAGCCCGGCCTCGCCGAGAGCGGCGCGCAGCCGGTCGTCGGGGACGGAGCG GTCGAAGAAGGTGAGGTTGTCCCGCAGCGAGGCGTGGAAGACGTGCACCTCCTGGGTGACCAGCGCGACCCGGCTGCGCA GCGCCCGGGGATCGATCTCCGTCAGGTCCAGGCCGCCTGCCGAGACCGAGCCCGCCCCCGGGTGGTGGAGCCCGAACAGC AGCCGGACCACGGTGGACTTGCCGCTGCCGGTGCGTCCCACGACGCCGAGGCGTTCGCCGGGGCGCAGGGTGAAGGAGAC GTCCCGCAGCACCGGCTCGTCGGGCTCGTAGCCGAAGGAGACCCCGTCGAAGCGGACTCCGGGCAGTCCGGCCGGCAGCG TCCCGCGTCCCGTGCGGGGCGCCGCCGTCCCGTGGCCCAGCAGGTCCCTCAGCCGCTGGGCGCTCGCGGCGGCGTCCTCG AGTTCGCGGAAGCGGGTGGTGACCGCGAGCAGGGGGCGGCGCAGCAGCATCGCGTAGGACAGGGAGGCGAAGGCCGTCCC CGTGGAGAGCTGTCCGCGGGCGTGCAGCCAGGCGCTGACCGCCAGGGCCAGGACGACGCTGACGGCGGACAGGCCCTGCA CCGTGGCGGGCCAGCGGACCGAGGCCCGCGCCGCGTCGCGGGCCTTCCGGTACAGGTCGTCCTGCCGGTCGCCGAGTTCC CGCAGGGTGTACCGCGAGGCCCCGTTGACGCGGAGGTCCTCCGCCGCCGCGAGGCGCTCCTCGAGGAAGCCCTGCAGGTC CGCCGCGACCCGCTGCCGCGCGGTGACGAAGGGCATGGCGCGGCCCACCAGGGTCCGCAGCAGCAGGAGGGTGCCTGCCG CGAACGGGGCGACCACCAGGGCCAGCCGCCATTCCAGCCGGAACAGGGCGACGAGGATGCCGACGATCAGCAGTGCCTGC GCCAGCAGTTCCAGCAGCAGCGTCGACATCACCGCGGCGAGCCGGGTGACGTCGCCGTCCATCCGCTCGACGAGTTCGCC GGGCGGATGCTTGCGGTAGAAGCCCGGCGGCCGGCTCAGGCAGTGCTCGACCAGGTCCGCGCGCAACCGGTTGGTGCTGC GCCAGGCGACCCGTGAGGACAGCGCCTCGGTGCCCGCGGTGACCACGAGCGTCCCGACGGCGGCCGCCAGGGACCAGGCG GCGAGGTCCAGCAGCGTCTTCCGGGAGTCGCCGGAGAGCGCCCCGTCGATGAATCCGCGCAGCAGGTAGGGCGCCACCAG CTGGAGCCCCATCCCCGCGGGGACCAGGAGGGCGAGCAGCGCCACGGCGGTCCGTTCACCGCGCAGGTAGCGGACAAACG TGGAGAGATGCCGCAACGGACTGTCTGCCAACGCGCCCCTCCCCCGTTCGCCCGGCGGCGAGCGGCCAGCATAAAGTCCT GTGCGCCTCCTTGTGAATGACGCCTCGTCAACGGCGGCCGGAGCACGCCCTTTCTGCGGGAATGCCGATAGCGGACGCCG CTCCGGGAGGGGGCGAAGCACACCATTGCTCGTGATTGACGCATGCTGTTAGACTCCCCACGTCTCTTGGTCCGGACATG CGTTTCTCAACGCCGAAAGCCTGGTCAACCGCACTTTCGGCACCGCACAGTCCCACGGCGTCCGAGCGGTCGCGCGAGTC GGCCCGGTCGAGCCAGAGGCAGCCACACGAACGTGCACCGCAATGCACCGCCTTGATCAGCCAGTTGTGAGCGAAACAAG GGGGATTCGTGTCGAGCGATACACACGGAACGGACTTAGCGGACGGCGACGTTTTGGTCACCGGTGCGGCCGGCTTCATC GGGTCGCACCTGGTGACGGAACTGAGGAATTCCGGCAGAAACGTTGTGGCGGTGGACCGGAGACCCCTTCCGGACGACTT GGAGAGTACGTCCCCGCCCTTTACCGGTTCGCTCCGGGAGATACGCGGTGACCTCAACTCATTGAATCTGGTGGACTGCC TGAAAAACATCTCGACGGTCTTCCACTTGGCCGCGTTACCCGGAGTCCGCCCGTCCTGGACCCAATTCCCCGAGTACCTC CGGTGCAATGTACTGGCGACCCAGCGCCTGATGGAGGCCTGTGTGCAGGCCGGCGTGGAACGCGTGGTGGTCGCCTCGTC CTCCAGCGTCTACGGCGGCGCGGACGGCGTGATGAGCGAGGACGACCTGCCCCGTCCGCTCTCCCCCTACGGGGTCACCA AACTCGCCGCGGAGCGGCTGGCCCTGGCCTTCGCGGCCCGCGGCGACGCCGAGCTCTCGGTCGGCGCCCTGAGGTTCTTC ACCGTCTACGGCCCCGGCCAGCGCCCGGACATGTTCATCTCCCGGCTGATCCGGGCGACGCTCCGGGGCGAACCCGTCGA GATCTACGGCGACGGGACCCAGCTCCGCGACTTCACCCATGTGTCCGACGTGGTGCGGGCGCTGATGCTGACCGCGTCGG TGCGGGACCGGGGCAGCGCGGTGCTGAACATCGGCACCGGGAGCGCCGTCTCGGTCAACGAAGTGGTCTCCATGACCGCG GAGCTGACCGGTCTGCGCCCGTGCACCGCGTACGGTTCCGCCCGCATCGGCGACGTCCGCTCGACCACCGCCGACGTGCG GCAGGCCCAGAGCGTCCTGGGCTTCACGGCCCGGACGGGTCTGCGGGAAGGTCTCGCCACCCAGATCGAGTGGACCCGGC GGTCACTGTCCGGCGCCGAGCAGGACACCGTCCCGGTCGGCGGCTCCTCGGTGTCCGTGCCGCGGCTGTAGGCGGCATGT GCGGCTTCGTCGGATTCAGTGACGCCGGCGCCGGGCAGGAGGACGCCCGTGTCACGGCCGAGCGCATGCTCGCCGCCGTG GCGCACCGCGGCCCCGACGGCTCGGACTGGTGCCACCACCGGGGCGTCACCCTCGCGCACTGCGCCCTGACCTTCACCGA TCCGGACCACGGCGCGCAGCCGTTCGTCTCCGCGTCGGGAGCCACCGCCGTGGTGTTCAACGGCGAGCTCTACAACCACG CCGTGCTGGGCGACGGGGCGTTGCCCTGCGCACCCGGAGGCGACACAGAAGTTCCTGGTGGAACTCTACGAGTTGCTGGG CATGCGGATGCTCGACCGGCTGCGGGGCATGTTCGCCTTCGCGCTGCAGGACGCCCGCACCGGCACCACGGTGCTGGCCG CGACCGATGGGGAAGAGCCCCTCTACTAACACCCGCGTGCGAGACGGACATCGCTTTCGCGTCGGAACTCACGTCTCTGC TGCGGCACCCCGCCGCGCCGCGCACACCGGAGGTGCGGGCGCTCGCCGACTACCTGGTGCTCCAGGCGTTCTGCGCCCCC GCCTCGGCCGTGTCGGGGGTGTGCAAGGTGCGCCCCGGCAGCTACGTGACCCACCGGCACGGCGCGTTGGACGAGACCGA GTTCTGGCGGCCCCGCCTGACCCCCGACCGGGGGGCGGGCCGCGGCCCCGGACGGCGGGAGGCCGCGCGGCGGTTCGAGG AGCTCTTCCGCGCCGCGGTCGCCCGCCGGATGACCAGCACCGACCGCCGCCTCGGCGTACTGCTCAGCGGCGGCCTGGAC TCCAGCGCGGTCGCCGCGGTGGCCCAGCAGCTCCTGCCGGGACGGCCGGTGCCCACCTTCAGCGCGGGGTTCGCGGACCC GGACTTCGACGAGAGCGACCACGCACGGGCGGTGGCGCGCCACCTCGGCACCGAGCACCATGTGGTGCGGATCGGCGGGG CCGACCTCGCCGGTGTGGTGGAGTCCGAACTCGCCGTGGCCGACGAGCCGTTGGCCGATCCCTCCCTGCTGCCCACACGT CTGGTCTGCCGGGCGGCGCGCGAGCACGTCCGCGGCGTGCTCACCGGTGACGGCGCGGACGAACTGCTCCTGGGCTACCG CTACTTCCAGGCCGAGCGGGCGATCGAGCTGCTGCTGCGCGTGCTGCCGGCCCCCCGGCTGGAGGCCCTCGTCCGGCTGC TGGTGCGCCGGCTGCCGGCCCGTTCCGGCAACCTCCCCGTGACCCACGCCCTCGGTCTGCTGGCCAAGGGCCTGCGCGCG GCACCGGAGCACCGGTTCTACCTCTCGACGGCGCCCTTCGGCCCGGGCGAGCTGCCACGGCTGCTCACCCCCGAGGCCGG GGCCGAACTGACCGGGCACGACCCGTTCACCGAGGTGTCGCGCCTCCTGCGGGGACAGCCGGGCCTGACCGGTGTCCAGC GCAGCCAGCTCGCCGTGGTGACCCACTTCCTGCGGGACGTGATCCTCACCAAGACGGACCGGGGCGGCATGCGCAGCTCC CTCGAGCTGCGTTCCCCCTTTCTCGACCTGGACCTGGTCGAGTAGGGCAACTCCCTGCCCACCGGCCTGAAGCTGCACCG GTTCACCGGCAAGTACCTGCTGCGGCAGGTCGCCGCCGGCTGGCTGCCCCCTTCCGTCGTCCAGCGGACGAAGCTGGGTT TCCGCGCGCCGGTGGCGGCCCTGCTCCGCGGCGAGCTGCGGCCCCTGCTCCTGGACACCCTCTCCCCGTCGTCCCTGCGC CGCGGCGGCCTGTTCGACACCGGGGCGGTGCGCCTGCTGATCGACGACCACCTCGGCGGCCGGCGCGACACCTCCCGCAA GCTGTGGGCGCTGCTGGTCTACCAGCTCTGGTTCGAGAGCCTGACGGCCGGACCCCGCGCCCTCGAGTCCCCCGCGTACC CGGCCCTCTCCTAGGAGACCCATGGCTGCCCCCGACCGACCGCTCGTCCAGGTGCTCTCCCCCCGGACCTGGGGCGAGTT CGGCAACTACCTCGCCGCGACGCGCTTCTCCCGCGCGCTCCGGAGCGTGATCGACGCGGAAGTGACCCTGCTGGAGGCGG AGCCGATCCTCCCGTGGATCGGCGAGGCCGGGGCGCAGATCCGGACCATCTCCCTGGAGAGCCCCGACGCCGTCGTCCGC AACCAGCGGTACATGGCCCTCATGGACCGCCTCCAGGCACGCTTCCCGGAGGGGTTCGAGGCGGACCCCACCGCCGCCCA GCGGGCGGACCTGGAACCGCTCACCCGGCACCTGCGGGAGAGCGCCCCCGACGTGGTGGTCGGCACGAAGGGGTTCGTGG CGAGGCTGTGCGTGGCCGCCGTCCGGCTCGCCGGGACGTCCACCAGGGTCGTCAGCCACGTGACCAACCCCGGGCTGCTG CAGCTGCCGCTGCACCGCAGCCGGTACCCGGACCTGACACTCGTCGGCTTCCCCCGGGCGAAGGAGCACCTGCTGGCCAC GGCCGGCGGCGACCCGGAGCGCGTCCAGGTGGTGGGCCCGCTCGTCGCCCAGCACGACCTGCGGGACTTCATGACCAGTG AGACGGCCGTCTCCGAGGCGGGGCCCTGGGGCGGCGACTCGGGCCCGGACCGGCCACGGGTGATCATCTTCTCCAACCGC GGCGGGGACACCTACCCCGAGCTGGTGCGGCGCCTCGCCGACCGCCACCCCGGCATCGACCTCGTCTTCGTCGGCTACGG CGACCCGGAGCTCGCCCGCCGCACCGCTGCGGTCGGGCGGCCCCACTGGCGGTTCCACAGCGTCCTCGGCCAGAGCGAGT ACTTCGACTACATCCGGCGTGCCTCCCGGTCCAGGTACGGGCTCCTCGTCTCGAAGGCGGGGCCCAACACCACCCTGGAG GCGGCCTACTTCGGCATACCGGTCCTGATGCTCGAGTCGGGGCTGCCCATGGAGCGGTGGGTGCCGGGACTGATCCACGA GGAGGGGCTGGGCCACGCCTGCGCCACCCCCGAGGAGCTGTTCCGCACGGCGGACGACTGGCTGACCCGCCCGTCGGTGA TCGAGGTGCACAAGAAGGCCGCGGTCTCCTTCGCCGCTTCCGTACTGGACCAGGACGCGGTGACGGCCAGGATCAAGGCC GCCCTCCAGCCCCTGCTGGACGCCCGATGACGGTCCGCCGCCCGGCCGCGTCCGCCCCCCGCGTCCTCCTGACCGCGGGC CCCGACGGGGTGCGCGTGGAGGGCGACGGGGAGGCGCGCCTCGGGCACCCCCTCACCGGTGACCACCTGGACCCGGGCCC GCCGGCCGAAGGCGTCTTCGCCGGGTGGAGGTGGGACGGCGAGCGCCTGGTGGCCCGCAACGACCGCTACGGCGTCTGCC CCCTCTTCTACCGGGCCGGCGGCGGCTCACTCGCGCTCTCCCCCGACCCGCTCGCCCTGCTGCCGGAGGACGGGCCCGTC GAGCTGGACCACGACGCGCTCGCCGTCTTCCTGCGGACGGGGTTCTTCCTCGCCGAGGACACGGCCTTCGCACAGGTCCG CGCACTGCCCCCGGCGGCCACGCTCACCTGGGACACCGGCGGGCTGCGGCTGCGGTCCGACGGGCCGCCGCGCCCCGGGG CCGCCGCGATGACCGAGGCGCAGGCGGTCGACGGCTTCGTCGACCTGTTCCGCGCCTCGGTGGCCCGCCGGCTGCCCGGC GAACCGTACGACCTGCCGCTCAGCGGCGGCCGGGACTCGCGGCACATCCTGCTCGAGCTGTGCCGCCGCGGCGCACCGCC GCGGCGGTGCGTCAGCGGCGCCAAGTTCCCTCCCGACCCGGGGGCCGACGCGCGCGTGGCGGCCGCCCTGGCGGGCCGGC TCGGTCTGCCGCACACGGTGGTGCCGCGCCCCCGTTCGCAGTTCCGCGCGGAGCTCGCCGCCCTGCCGGCCCAGGGCATG ACCACCCTGGACGGCGCGTGGACCCAGCCGGTCCTGGCCCACCTGCGCCGCCACAGCCGCATCTCGTACGACGGTCTCGG CGGCGGGGAGCTCGTCCAGAACCCGAGCGTGGAGTTCATCCGGGCCAACCCCTACGACCCCGCGGACCTGCCCGGCCTGG CGGACCGGTTGCTGGCCGCGAGCCGGACCGGCCCCCACGTGGAGCACCTGCTGAGCCCCCGGACGAACGCCCTGTGGAGC AGGCAGGCGGCGCGGCGGCGCCTCGTCACCGAGCTGGCCCGGCACGCCGACAGCGCCAGCCCGCTCAGTTCCTTCTTCTT CTGGAACCGGACCCGGCGCTCCATCTCCGCGGCTCCGTTCGCCCTGGGGGACGGACGGGTCCTGACGCACACCCCCTACC TCGACCACGCCCTCTTCGACCACCTCGCCTCGGTGCCGCACCGCTTCCTGGTCGACGGGACGTTCCACGACCGGGCGCTG CACCGGGCCTTCCCCGAGCACGCGGACCTGGGGTTCGCCTCGTCGGTGCCCCAGCGGCACGGACCCGTGCTGGTCGCGCA CCGACTGGCGTACCTGCTCCGGTTCCTCGCCCACGCGACGGTCGTGGAACCGGGCTGGTGGCGCGGCCCCGACCGCTTCC TGCAACGGCTGCTGGCCGCCGGCCGGGGGCCCGGGGCCCCGCAGCGCGTCAGCAGGCTGCAGCCCCTGGCGCTCTACCTG CTGCAGTTGGAGGACCTCGCCGTCCGAAGGGCCCGCCGCCGGCCGTAGCGGGGCCGGACCGCCGCAGACCCCCACTTCAC GAGACATCAGCCGCAGGGCCCAGAAGGAGCACATCGCATGCGGAAGACATTGCCCGTGATCAGCACAGGTCCCGCCGCGG GAGCGACGTCGGGCGGATGCTCCGCCCCGGCCGAGACCCCGGCCCGGTCGGGAATACCGCTGTGGCGCAAGCGCAAACTG CGGATCGCCCTGGTGCGCCATCACGACCTGTGCCTGAACACCCGTCAGATAGCGCGGGTCCAGAAGCGGGCCGGCGTGCT GCCGCACCTCGGGGCTGGGTTACATCCACACCGCGCTCAAGTCGGCCGGGTTCCACCACGTCATCCAGGTCGACACCCCC GCCCTGGGCCTCGACAGCGAGGGGCTGCGCAAGCTGCTCGCGGACTTCGAGCCGGACCTGGTCGGGGTGAGCACCACGAC ACCCGGTCTGCCCGGCGCCATCGAGGCGTGCGAGGCGGCCAAGAGCACCGGGGCGAAGGTGATCCTGGGCGGGCCGCACA CGGAGGTGTACGCGCACGAGAACCTGGTCCACGAGTCCATCGACTACGTGGGCGTCGGCGAAGGCGTCACGATCATGCCG GAACTGGCGGAGGCGATGGAGCGGGGCGAGGAGCCGGAGGGCATCCGCGGCCTGGTGACCCGCAAGCACGACGGCGGTGC CGCGCCGATGGTGAACCTGGAGGAGGTCGGCTGGCCCGAACGCGCCGGGCTCCCGATGGACCGCTACTACTCGATCATGG CTCCGCGGCCGTTCGCGACGATGATCTCCAGCCGCGGCTGCCCCTTCAAGTGCAGCTTCTGCTTCAAGCAGGCCGTGGAC AAGAAGTCCATGTACCGCAGTCCCGAGGACGTCGTCGGTGAGATGACGGAGCTCAAGGAGCGGTGGGGGGTGAAGGAGAT CATGTTCTACGACGACGTGTTCACCCTGCACCGCGGCCGGGTGCGGGAGATCTGCGGGCTCATCGGGGAGACCGGCCTCA AGGTCCGCTGGGAGGCGCCCACCCGCGTCGACCTGGTGCCCGAGCCGCTGCTGGAGGCGATGGCCGGGGCCGGGTGCGTG CGCCTGCGGTTCGGCATCGAGCACGGTGACAGCGAGATCCTCGAGCGGATGCGCAAGGAGAGCGACATCCAGAAGATCGA GAAGGCCGTCACCTCCGCCCACGAGGCCGGGATCAAGGGCTTCGGGTACTTCATCGTCGGCTGGCTCGGGGAGACCCGGG AGCAGTTCCGCAGGACCGTCGACCTCGCCTGCCGCCTCCCGCTGGACTACGCCAGCTTCTACACCGCGACGCCCCTGCCG GGCACCCCCCTGCACACGGAGTCCGTGGCCGCCGGCCAGATCCCGCCCGACTACTGGGACCGCTTTTCGTGCGGGGCGAG TTCGACGCGCGGATCGGGTACCTGGTGCCGGACGCGCAGGAGCGCGCCCAGTGGGCGTACCGCTCCTTCTTCATGCGCCG CTCCATGGTCAAGCCGCTGCTGTCGCACATGGCGGTGACCGGCCAGTGGCGCAACACGCTGGACGGCCTGCACAGCCTGT ACCGGTCGACCTCCAACACCGACCGTGACTTCTGAGCCCGCCGCCCCGGCCGTCCCGCACCCGCCGGTGCGTCCGGGGCC GCCGGTCCGTCTCAACCGGCCGCTGGCGCGGCGCAGGCGGCGGCCGGCCGGGGAGGGGTTCGTGACGCACCACCTGCGGA GCACCATGGCCCGCGGGTTCCGCCCCCCGGAGTCCTGGGAGGTCCCCGTCCGGCACGTCCTGCCCGGTCTGCCGGCCGAC GGGACTCCGCGCGCCGAGGAGGCCGCTCAGGCGCTGCGCACGCCCGCCGGGCGGCCGGGCATCGCCCTCGTCGTGCCGAC CTACGTCTCCCGGGTGAGCCTGGCGCGGCAGCGGGAGTGGTTCGACGCGCTGCTGGACCAGGCGGCCGCGGTGACGCGGG ACCACCCCCTGGTGCCCCTGGTGCTGTTCGTCGGCATGCAGTGGTCGTCGGCCGAGGAGGAGCGGGAGGCGCTGCGGCGC CTGCGTGTGCTGCTGGACGACGCCCGCACCCGGCTGCCCGGACTGCGGATCTGCGGTCTCGCGCTGCCCGGGCCGGGCAA ACCCCGCACCCTCAACGGGGCGATCGCCGTCGCCGAGCTCCTCGGCTGTGCGGGCGTCGGGTGGACCGACGACGACGTGA CCCTGGAGGAGGACTGCCTGTCCCGGCTGGTGCGGGACTTCCTGGCGGCGGGCTGCCGCGGGGCGGTGGGCGCGACCAAG GTTGCGCACACCCATGAGTACGCCACCTCCCGGCTGCTGTCCCGGGCCAAGGCGATCGCCGCCCCGGCCACGAACTACCC GCACGGCTGCTGCATCCTGGTGGCCACCGACGTGGTGGCCGGTGGTCTGCCGGGACGCTACGTATCCGACGACGGCTACG TGTGCTTCCGCCTCCTCGACCCCGCGCTGCCCGACCCGCTGGCCCGGCTGCGGCTGGTTCCGGACGCCCGGTGCCACTAC TACGTGGCGGGGCCGGCCGGCGAGACCCGCCGCAGGATCCGCAGGCTGCTGCTCAACCACCTCGTCGACCTCGCCGACTG GCCCCTGCCGGTGGTCCGTCACTACTTCCGCCACGTCCTGTTCGGCGGCATGTGGCCGCTGACCGGCTTCGACTCCTCCC GCGGTGCCCGCCGCGGTGTGCAGAAGGCGCTCATCAAGTGGCTCTACTTCGCCTGGTTCGCGGGCATCGGGGGCGAACTC TACGTGCGCGGGCTGTCCGGCAGGCCACTGCGCCGCATCGAGTGGGCTCCCTACTCGGACATCCGCAGGCTCACTCCGTC GTCCTCACCCACGCGTCAGGAGAGCTGATGAAGGTACTGTCGCTCCACTCCGCCGGCCACGACACCGGCGTCGCCTACTT CGAGGACGGGCGGCTGGTCTTCGCGGTCGAGACCGAACGGCTCACCCGGGTCAAGCACGACCACCGCTCCGACGTCGCCC TGCGGCACGTGCTCGAGCAGGAGTGCGTGGACACCGACGGGATCGACCTGGTGGCCGTCAGCACCCCGGTCCGCAGCGGG CTGCTGCGCATACCCGACCTGGACCGGGCCATGGAGCGGATCGGGGCGGGCGCCCTCCACCACCGGACCGTCTGCGAGAT GCTGGGGCGGCGGGTGGAGTGCGTCGTGGTCACCCACGAGGTCTCCCACGCGGCGCTGGCCGCCCACTACGCGGACTGGG AGGAAGGCACCGTCGTCCTCGTCAACGAGGGCCGCGGCCAGCTCACCCGCAGCTCCCTGTTCCGGGTGACCGGCGGGGCC CTGGAGTGGGTCGACAAGGACCCGCTGCCCTGGTACGGCAACGGCTTCGGGTGGACGGCGATCGGGTACCTCCTCGGCTT CGGCCCGAGCCCCAGCGTGGCGGGCAAGGTGATGGCCATGGGCGGCTACGGGCAGCCGGACCCGCGCATCCGCGAACAGC TGCTGTCGGTGGATCCGGAGGTGATGAACGACCGGGAACTCGCCGAGCGGGTGCGCGCGGACCTGGCCGGCCGGCCCGAG TTCGCCCCCGGGTTCGAGACGGCGTCGCAGGTGGTGGCGACGTTCCAGGAGATGTTCACCGAGGCCGTCCGGGCGGTGCT CGACCGGCATGTGACGCGCACGGACGCCGGGGTGGGCCCGATCGCCCTGGGCGGCGGGTGCGCCCTGAACATCGTGGCCA ACTCGGCGCTGCGGGAGGAGTACGGGCGGGACGTCGCCATCCCGCCCGCCTGCGGGGACGCGGGTCACCTGACGGGCGCC GGCCTCTACGCCCTCGCGCAGGTGGCCGGGGTGAAGCCGGAGCCGTTCAGCGTGTACCGCAACGGCGGGGGCGAGGCCCG GGCCGCCGTCCTGGAGGCGGTGGAGGGCGCGGGGTTGCGGGCCGTTCCCTACGACCGGTCCGCGGTCGCCGGGGTGCTGG CCGGGGGCGGGGTGGTGGCGCTGACGCAGGGAGCGGCGGAACTGGGGCCGCGGGCGCTGGGGCACCGGTCGCTGCTGGGC AGTCCCGCGGTGCCGGGCATGCGCGAGCGGATGAGCGAGAAGCTCAAGCGGCGCGAGTGGTTCCGGCCGCTGGGCGCCGT GATGCGCGACGAGCGCTTCGCCGGGCTGTACCCGGGGCGGGCGCCGTCGCCGTACATGCTCTTCGAGTACCGGCTGCCGG ACGGGATCGCGCCCGAGGCCCGGCACGTCAACGGCACCTGCCGGATCCAGACCCTGGGCCCCGAGGAGGACCGGCTGTAC GGTCTGCTCGCCGAGTTCGAGGAGCTGAGCGGTGTGCCGGCGCTGATCAACACGTCGCTCAACGGCCCGGGCAAGCCCAT CGCGCACACCGCCCGGGACGTGCTCGACGACTTCGCGCGCACCGACGTCGACCTCTTCGTGTTCGACGACCTGATGGTGC GGGGCGCCGCCGCGCGGTAGCCCCCGGGGTGGGGCGGGACGGCCGGCCGGAGACGCTCCGGCCGGCCGTCGGTCACTCCC CCAGGTGCCGGGGAAGCAGCCGTACCAGCACGTCGTCCGTGTAGAGGTGGACGACCGGCACCAGACCGGGCGCGCCGGGC GGTGCCGCCACCGCGGCGAGGGCCCGCCCGCGCAGCTCCTCCAGCAGGTCCACGACGTCCTGGCCGGCCACGGCCCCGGT CCGCATCAGATGGGCGAGGTTGCCGTCCCGCTCGCCGTTGCGGTCGTAGTCGGTCAGGTCGTCCGCCATGGTGATGGTCA TGGCGAAAGCCTCTGCGAACTCCCTTACGGAGTCCGCCGGTTGGCCTTCCCCCCCGCAGGCGGCCGCGAGTGCCCCGTAG CGGCCCAGGAAGGTGGAGCCGTAGGTGCTCGCATGGGCGCGCCACTCCCGGAGGTTCGTCGCCCGAGAGCGTTTGGTGCG TATCTGGCCGCCGCAGAGGTGGACGGCGTCCTGCTCCAGGATGTCCGTCACCGCCTTGGGGTCCCGGGCGAGGGATTCCA GTTCGTGCAGCGCCCGCAGGTGGAGGCGGAGGCAGACACAGGCGAGTTCGACCCGGTCGAGTCCGGTGTCGTCGTCCATC AGGTCGTCGAGGAGCTTCATGGAGACGATGTCGAGGGCCAGCGCGCGGGACACCGCGGCCCGCCGGTCCGGGTCGGTCGT CCACTCGGTGAGGAAGTGGGGCACCCTCAGGTACAGGCGCAGGGCGGCGGTGTGCGCCACCAGGTCCGGCGACCCACCGG TCTGCGCGACGCACCGCGTGACATGGTCGCGGTTGGCGGCCTCGGCGGCGAGCATGGTCTCCGTGTAGTCCGCCGGCAGG GCCGTGGCCGGGGCGGCCGTCACCGCCCGCTCCCCGGACGGGCCGGGGCGGCGGGCCGCCTCCCGGCGATCTCGGCGAGG GCGGACCGCCAGTCCGGCTGTTCCAGGGCTCCGGCGAACCCCACGTAGTCCGCCCCGCTGTCGAGGTACTCGGTGACCTG CCGCCCGGAGCGGACGTTGCCGCTCACGAAGAGCACCTGGTCGGGGCCGAGCCCCTTGCGGAAGTGGCGTACGACCTCGG GCGGCACGTGCTCGTTGCGCGAGTACAGGTACACCATGTGGAAACCGAAGGCACGGGCGACGTGGAGGTACCGGTCGATC TCCTCGGTGGAGGCCGTGCTCACCGGCACGGTGCCGAGCAGGTCCCCGGTGCGGGGGTCCTCGCCGAAGGTGAGGGCGAC GGTGAGGAGCAGCTCGGGCCACTCCTCGCGGGGTATTCGGCCGGGGAAGGCGGCCAGCGTCTCGAGGAAGCTCTTCCAGA CGAAGTAGTCGTCGCCCGAGCCCAGCAGCGCGGGCAGCAGGAGCGCGTCCGCGCCGCGGACCACCGGGAAGCCGGCCCCC GGGCGGGGCGGGAAGTGCAGGACGACCGGTAACGGGGTGGCCGCCTTCACCGCCGCCACGTACGGCTCCATGTGCGACTC GAACGACTCGTAGTCGGTGCTGGCCAGAAGGACGGCGGCGAAGCCCAGCCGCGTGAGCTCCGCCGCCTTCTCGACCGCTT CCGTCACCGGGACCTTGAAGGGGTCGATGATGTGGACGGGGCCCGGTTGGTGCTCGCGCAGCCGGGCGAGCACGCGTCCC GGCCGCCAGAGCGGTGGTGCGGCGTGGAGTTCCGTGTGGTGGTCCAGTTGCGGTGAGGCGTTCACCAGCGTCTTCCCCCT TGTCGTCCGGCTCGTCGTCCGGCTTGTCGTCCGGTCGGGTCACGCGACGGGGTGCCGGCCGCGTCGCACGTGCGGATCGC GTCCGGATGAGGTGTCGCGCGTTCGGATGGAGGTGCGGGGCGCCCTGGTCGCCGAGGCCGTTCCCGGCCGGCCGGGAGTG TTCCTCCCGGTGTGCCGCGCCGGCGCGAAAGCCGTGGTCCGGCGCCGTCGCGCGGTTTCCTTCAGACCCGCCCGGGGAAC TGCGTGACCGTTCCGGCACACGCCGCGGTCGAGGGAGTGCGGAAGTGCTCGGAAATCCTTCGGCGGGCCCGTCCGGCGGA TTCACCGGCGGACGGACGAAAAGCGTCGTTCACGTACTCCCCTTCCACTGGAGAGACGAACAGCGGGTCCACCGGGCCGC CTCGAGGACAGGGTGCGGCAGGGCGGTTGCCGATACTACACGCGTTCGTTTCCGTGGGGTAGGGAGACTTTGTGCGGCGG TTATGCATTCCTGCCGGACGGAAGAAGGCACGCCCCGACGGTTTCGCGCCGTGCGGGGCGTTCTCGGCGGTGTCCGGCGT ATTTCACGCGAATTGCAGATGGCGCCGGCGGCGCAATCGGCCCGCCGTCACGCAACCGCTCACCGCGACCAGCAGCAGCG TCACTCCGACGGCGTGGGCCACCCCGGAGTCGAACGACCCGCTGTCCGCGCCGTCGACCGCGTAGGTGATGATCTCCCGG GTCGACCAGAAGGGAAGCACCTTGGCCGAATCCTTGGCCGGATCCATCACCATCTGGGCGCCGATGACGGAGATCAGCAG CAGGGCGCCCTCCATGTCCCGTGGCACGGCCGCCCCGAGCAGCAGTCCCAGCGGCACCGCCACCAGTGTGGTCAGCGCCA GTTCCACCGCGACGGCCCGCGGGTGCGCCACGTCCTGCCCGACCAGGATGATCACGGCGTAGAGGGCGGACACGCCCATG CCGGCGGTGAGGAGGGCCAGCAGGCGGCCCAGGAAGAGCTGGAGCGGGCGGAACCCGGAGAGGGCCAGGAGCGGTTCGAT CTCCCGGCCGCCGACCGCGGAGAAGAGGGCCGCGGCGCTGACCGCGAAGCCCACCCCGAGGCTGGCGAACCGGACCGCCT GGCCGGTCTGGTCGTAACGCCCGAGGTAGAAGACGAGCGGGACCAGGAGCAGCAGGCCCAGCACGCCCCGCCGGCGCAGC AGTTCGCGGAAGGTCATCTCCGCCATCCGCAGGGTGGCCGTCATCGGTTGCCCCTTCCTTTGCCGGGGGTGAGGTCCAGC ACCTGGTCCACCCGGTCGAGCTGGTTGAGCATGTGCGTCACCACGACGACGGCCTTGCCCGCCTCGCGCCACTCCCAGAC GCTCTGCCAGAAGTCCACGTAGGAGCCGTGGTCGAAGCCCTGGTAGGGCTCGTCGAGCAGCAGCAGGTCCGGGTCTCCCA GGGCCGACAGGACGACGTTCAGCTTCTGGCGGGTTCCTCCCGACAGGTCCTTGGCAAGGACGCCCTCCGCGGGGGCCCAG TCGAGCTCTCCCGCGAGTCTCCGGCCGCGGCGGTCGGACTCCCGGCGGCTCAGGCCCCGGCCGGTGCCGAAGAGGGTGAA GTGCTCCCGGGGGGTCAGGAAGCCCATGACCCCCGCGTTCTGCGGGCAGTAGCCGAGGTGGCCGGAGACGGTGACCCGTC CTTTGTCGGGGGAGAGCAGACCGGCGCAGATCTTGAGCAGGGTGGACTTGCCCGTCCCGTTGCTGCCGACGATCGCGGCG ACCTCGCCCGCGTGCACGACAAGATCGACCCCGGTCAGGACGCGGCGGCGCTTGTAGCGTTTCACGACGCCGCGCGCCTG CAGGAGAATCTTGCGGTCGGCGGGCTCGGACATGCCGTGGTACCCCTCTCGGGCACCGACGGAATGGCCCATGACTGCCA CCTTTCTGCCGACCGCGACGAAGGGCACGCATTGTCGGCCTCAATGGTCAGGATGCGTGATCCGGTCGGGTTCATCGCCC CGGCCGCACGCGCACGGCTCAGCCTGCCACACGGCCTGCCCCACATAGCGCGTATCGGTCGGGCCCCCACTTCCCCGAAA GTCCGGGCCCCCGGCCGGTGTTCCGGGGATCCTACGGGGCGACCGGGCGAAGGACTGAAGCCGGGCATCCGCGTTTCGGC CATCTCTCGCCGATACCCGGGGCGCCCTTGTAGGCCGGCCCGGGGCTGGTTAGCGTACCGACCGACCGCAATTCACCGCT CACTCGTGCGTCGCCCGCACCAGCTTTTCCCTTCTTCCGGAGTCCGCCGCCGGGGCGGGAGCGGGCGGGACCGCACCCCG TTCAGGCAAGAGGGAAATCCGCTCGGAATCGACGAAGGGGACGTGCATGCGCGGGGGGACCGTGGACCGTCGTGTCTGGT GGCAACGGGCCGTGGCCCGCGGTTTCGCGCCCACCGCCGGCGCGGCCCACCCCGTTCGTCCTGGTGGGACCCGAGGGACC GGACTCGGACGTCCGGGCGAGGTGCGCGCGGACGGCGTGATCGGCGCGCGGCCGGGCGGGGCGGCGTTCCGCTGCGTCCG GCGGGCTCGGGGTGCTGCTGCCACTACGTCGGCAGTGCGACGCGAGCGCACGAGCAGACGTCGTCATGTGCTGCGCCGTC TGGCCGAGGTGCGGGAAGCGCACCCGTCCCTGCCGCTGACCGTCTGGGTGGGCATGCAGTACGGCCCCGGGGAGGACGAG GAGGCGCTGCGCAGGCTGCGCCGGCTGTGCGCCCCGGTGCCCGGGGGCCCGGCCCTCACCGTGGTCGGCCTGGCCCTGCC CGGGCCGGGCAAGCTCCGCACGGTGAGCACGGTCCTGCGGCTCTCCGAGGACCTCGGCTACGCCGGCTGGCTCTGGACGG ACGACGACATCGAGATCGCCCCCCACTGCCTCGCCCTGCTGGTCTCCCGTTTCCGGGAGCGGGGGGAGCGGGGCGCGGTC GGGGCGCATTCGGTCGCGCTGGCCAGGGAGACGGTCACCTCACAGGCCATGGACCGGGTCTCCGGGGTCACCGCCCCGCC GAAGGCCTGCCCGGCGGCGGCCTGCCTGGTCGTCGCGACGGACGTGCTGGGCACCGGCATTCCGGTCAGGCGCCTGACCG ACGACGGGTACGTGGTGTTCGAACTGCTCGACGCCGGGGCGCCCGATCCGCTGCACGACCTGGAGGTGCTGCCCGAGGCC CGGATCAGCTTCTACCGCGTCAGCCGCACCCACGACACGTTCCAGCGCCTGCGCCGCTCCCTCTACAGCCATGTGACCTG CGTCGCCGACTATCCCTGGCCCACCGCGCGGGTCTACCTCACCCGGGTCCTCTTCCACGGTCTGTGGCCGCTCGCGGCGT GGGACGGCAGCCGGGGGCCGGTGCACGGGCTGCAGCGCTGGCTGGTCAAGGGCCTGCACTTCACCTGGTTCTGCGGGGTG GCCGGCTCGCTGGCGGTCCGGGGCGCGGTGGGACGGCCCCTTCGCCGGGTGGCGTGGGGCGACGAGGGGGACTTCCGCAG CCCCACCGTCGAGGAGCCCGCCGCGGGAGCGGCCGCCGGGCGCTGACACACGAGGTCACCCCGAGGGGCGGCCCGGAAGG AGACGCGATGGTGACAGCGGGGCCGGCCGGGGCGGCGGTGACCGTCGTCCTGCCTCACTACGACTGCGCGGCGTACCTGG GTGCGGCCGTCGGATCGGTGCTCTCCCAGGACCGCCCGGACCTGCGCCTGACGGTGGTGGACGAATGCTCGCCCGAAGAG AAGTGGGCCCGCGCACTCCACCCGTACGCCGGCGACCCCCGGCTGACCGTGGTCCGCACCTCCCGCAACGTCGGCCACCT GCGGATCAAGAACAAGGTCCTGGAATCGGTGGACACCCCCTACGTGGCCTTCCAGGACGCCGACGACATCAGCCTGCCGG GCCGGCTGCGCCACCAGCTGGCCCTCCTGGAGAGCGGCGGCGCCGATCTGGTCGGCTGCGCCTACTCCTACATCGACGAG GCGGGCCGTACGACGGGACACCGGCGGATGCCCCGCAACGGCAACCTCTGGATGCGGCTGGGGCGGACGACCGTGCTCCT GCACCCGTCCTCGGTGGTGCGGCGCTCGGTGCTCGAGAGGCTCGGCGGCTTCGACGGCACCGCGCGCCTGGGGGCCGACA CCGACTTCCACCTGCGGGCCGCCCGCCTGTACCGGCTGCGCAGTGTGCGCAAGGTGCTCTACCGGTACCGGATCTGGCCC AAGTCGCTCACCCAGGCGCCGGACACCGGGTTCGGGTCCGCGGAGCGCCGGGCCTACACCGAGGCGATGACCGCGCAGGA GGAGCGGCGGCGACGGGCGCGGACCCGTGAGGAGCTGCTGCCGCTGCTGGTCGCCCCGCCCAACGACGTCGACTTCACCC TGACCCGGGTCGACCTCGACTAGCCGACGGAGGGGGAACGGCGTGGACGGCACCTCGGCGAGGACCGCGGACGAGGCGTT GCCCGGGGTCGCGGTGGTGGTGGTCGATCCGGACGGCGACGGGCGGCGCGCCGTGCGCGGCCTCCTCGCCCAGACGGTGC GTCCCGTCTCGATCACCCTGGTGACGGCGGCCGGCCCGACGGCCGGCGGCACCCGGTCCCCCGGGCCGGCCGTGCCCTTC GACGACCCGGCGGTGAAAGCCCGTACGGGTCGTCCGGTGCGCTCGCGGGGACCTCGGGCCGGCTTTGCGTCGACGCGGCC AGGAACGCGGGGGCGCCGTACGTGGCCGTCCTCCGCGGTGACGACGAGGCGCTCCCCCACTGGCTGTGGCACCTGGCGCG GGCGGTCTGGTACGGCGGCGGGGACGGCACCGGGCCGGTCGGCCTGGTGCAGTGCGGCGCCCTGCGGCTGAGGGACGACG GCCTGGTGGACGGGTTCGCCCTGCCGCCCGCGTCCCCGCGGACCCGGCCCTCCCCCTCGGACCTCCTCGAGGGCGCCTAC GCGGTGCGGCGCGAACTGCTGGACGCGGACGGCGGTACGGCGCCCTGGGTCGCCCTGCCCATGCCGCTGGTCCGCCGCCG GTCCGGCGGCGCCGGGGACCCGGCCGCGGTCCTGGCCCCCGGGACGCGCGTCGCGCGACGCACCCGCCTGGTCCGGCACG GGTACCGGCCGCCCGCCGCGAGGCCGCGGAACGGGAGCACTCCCCGGCTGGTGTCGGTGGTCGTCCCGGTGCGCAACGGC GCCCGCACGCTCGCCGCCCAGCTGACCGCCCTGGCCCGGCAGACCGGAGCCGTCGCCTACGAGGTGCTGGTCGTCGACAA CGGCTCGACGGACACCACCCGCGAGGTCGCCGAACGGGCCCGCGCCGAGCTGCCGGACCTGCGGATCGTGGACGCGTCCG ACCGTGCCGGTGAGAGCTGTGCCCGCAACCGGGGAATCGCCGCGGCGCGCGGCGACTTCGTCGCGTTCTGCGACGCGGAC GACGTCGCCGACACCGGCTGGCTGGCCGCGATGGCCCAGGCGGCCAAGGAGGCCGATCTGGTGGGAGGCGGACTGGAGAC CTCCGTGCTCAGTCCCGGCCGCGTCGACGAGCAGCCCCTGCCGATGGACGCCCAGACCGATTTCCTGCCGTTCGCCCGGG GGGCGAACTGCGGTGCCTGGAAGGACGTCCTGACCGCGCTGGGCGGCTGGGACGAGCGCTACCGGGGCGGCGGGGAGGAC ATGGACCTCTCCTGGCGCGCCCAGCTCTGCGGTTACCTCGTCCGCTACGCGGACGACGCCCGGATGCACTACCGGTTGCG GGACGGACTGCCGGCGCTGGCACGGCAGAAGTGGAACTACGGCCGTTCCGGGGCCCAGTTGTACGCCGCGTACCGGCGCG CCGGGTTCGAACGGCGCGACGGCCGGGTGGTCGTCAGGAACTGGTGCTGGCTGCTGCTGCACGTTCCGAACCTGGTCCGG TCCACCGGACCCTGCGGCCACGCTGAGTCCGCTACGCGCCCGGCTGGCCGGTTTCCTGGTTTGTGAACGTGCGGCAGGGC GTCAGGTCCTTGTTGGTGGGCGGGCGTCCGGCGCCCGCGGGACGCCGGGCCGGCACCGCGGTGGCCCGGCGCGCCGCTCC CGGGTTCAGACCAGCCGGTGGCCGGGGTCCTGCGCCACCGGGTCGTCGCCCGCCATGGCGAGGCAGGTGGCGCGCAGGGC GGCGACGACGGCCTGGTCCTCGCCCCAGGCGTCGAGTTCCGGGCCGTCCCCCGCCTTCAGGGCCGGCACCGGCACTTCCA TGATCTTCGGATGCCCGTGCCGGACCGGGGACTCGGAAGGGGCCACCAGCGCCTCGGTGAGCTTCTCCCCCGGCCGCAGC CCGACGTAGCGGACCGGGAGCTCCGCACCGGCGTGCGCGATGAGCCTTCTGGCGATGTCGAGGATCCGGACCTGTTCCCC CATGTCCAGGACCAGGGCGTGGCCGACGCTGCCCAGCGCGACCGACTGGATGACCAGTTCCACGGCCTCCTGGACGGTCA TCAGATAGCGCGTCACCTCGGGGTGGGTGACCGTCACCGGTCTGCCGGCCGCGATCTGCCGGGCGAAGACGTCGAGGAAG GACCCCTGGCAACCGAGCACGTTGCCGAAGCGCACGCTCACGTACGGTCTGCCCGCCTGGATCGCGGCCGCCGCGGTGAG TCCTTCGGCTATGCGTTTCGAGTATCCGAGCACCCCGACCGGATCGACCGCCTTGTCGGTCGAGATGTTCACCAGGAACG CGACGTCCGCGGCCAGGGCCGCCTCGAGCACCGCTCGGGTGCCGAAGACATTCGTCTTGACGGCTTCCCCGGGGAACTTC TCCAGGATGGGCACCCATTTGAGGGCCGCCGCGTGGAAGACGGTGTCCGGCCGGCACTGCTGGAACAGCCGGGCGAGCCC TCTGGAGTCCCTGATGTCCGCGAGGAGGATGGAGGTCCGCACCGACGGGGAGACGTTCCCGATGCTGGTGGCCGCCAGGT GGAGGGCCGTCTCGTTCCGGTCGAGCATCATGAGGCTCTCGGGTTCCCACCGGCTGAGCTGCCGGCACAGTTCCGATCCG ATGTAGCCGCCGGCTCCGGTGACCAGGATCCGTCGGCCGCGCAGTAATCCGGCGCTGCTCTCGAGACCGGTCCTTATTCG TTGGCGGCCGATAATCCTCTCGAGGTCCAAgGTGAGAGGTCCACCGGCCGGAAAGTTCGCGTCGTACCCCACGGAATTAT CGCCAAACATGCAGTCACACTTCCTTTTTGACAAGAGTCATGACTGACGTGCCGACCCACACGACGAACGGGACCGACGT ATCGTCTTGGTGCTTTCCTCACCGGCACCACCGCGTTCCCCCACCGGTGCCTGCGCACGGGGATCACATTCCGGCGGCCG GGTCGCCACCCGCTGCGCCGGCTCCGCCGACGTCGGACGTGTCCTCTTCCGACACCCCAGGACGACCGCGAAAATCACTT TATCGAGGCGCGGCGCGGTGCGGGCGGGTTTTCTCAACGGACGCCCCGCGTCACCGGAACGCCGGGGCCGAGGAATTCGC GCGCGCCCCGCAACCGGGTCCGGAGCCGGGCCCGCGTGGCATCGGTGACGAGACGGATGTCGAGGTTCCCCGGCGGGGCC ACCTCGCTCCGCAGACCGGCCGGCAGCCGGGCCGGGTCCAGTCCGTCCCGCCGGGCTATGAGGACACCCAGGTCGTGACG GTTCATCGCGTCCGGTCCCGCCACGTGGAACACCCCGGACCCGTCCGACGCCGCGATCTCCAAAAGCGCGGAGGCCAGAT CGTCGACGTGGACCGGACAGCGGACGTCGTCCGTGAACAGGACGCCGGCGCGCCGGCCGGCCGCCAGGGCGTGCACCGCC TCCTCGTGGGCGGACCGGTTGTGCCCCACGATGAGCGAGGTGCGCACCACGGCGGCCTCGGGCACGGCCACCCTGACGGC CGTCTCCGCCGCGGCCTTGGCCGCGCCGTACGGGGAGACGGGGTCGGGGAGGGCCTCCTCCGGGTAGTGGACGTCGGCTC CGGAGAACACGGCGTCGGAGGAGACGTGGACTAGTCGGCAGCCGGCGCGCGCCGCCTCCAGGGCGAGGCGGGCCGCGCCG TCGGCCGTGACCGCCCAGTCGGCGTGTCCGCTCGACGCGTTGATCACCGCGGCCGGCCGGGTCCGGGCCAGCACCTCTCC CATCCGCCCCGGGTCACGGAGGTCGGCCCGGTACCAGGTGACCGGCGGCAGTTCCTCGGGGCGGGTCCGGTAGGTCGCGG CCACGTCCCACCCGGCGGCCACGGCCCGGCGGAGCACCTCGTACCCGAGGAAGCCGCTCCCGCCGACGACAAGAACTCTC ACGCACCGCCCCCCTGACGTCCGGCCCGCCCCCGATCGCGCCCCAGAAGTACGGGGACGACGGCCTGTGCGGCCGTGTCG AGCCGTGGTTCCTGTGGGCCCGATGCTACTGAAGGCCACCGAGCGGCGGGGAGAGTGAGTGCTTTCCGTCTTCGGTCCGA ACCGGCCGGGGAGGCTCCGCAACGGCTCCCGCGGGCAGGAGGGCGGCGGCCCCGAGCGGCCTTGCGCACGGCGGGACGCC GGGCCCGCGGTCCGAGGCGTTCCCCCGGGGCGCGCAGCCGGTGCCACCGGCGGGGCGGCGACGGCCCGTCCCGCTCCTGG TCCCGGGGCGGGCGGCGTGTGACCCGCCCGCCCCGGCCGAGCGGGTGCGTCCCCGTCAGGCACGGCGGGGTGCGCCCCGT CAGGTACGACAGGGAGCCGGGTTCTCCGGCCGGGCGGGGATCCCCGCCCGCACACGGGCGAGCAGGGCGGCCGGGTCCGG GTCCGCCTGCAGGGACCGCAGCACCAGCGTCCACCACTGCTCCAGTCGCTCCCTCCAGTCCAGCCCGCCCGGCATCTCGT CGGTCAGGGTGCAGAGTCCGAAGAACGCGCAGACGAGCGTCACGGCCGCGGCCGACGGCTCGACCCCTTCCGCGAGTTCG CCCCCGGCGCGGGCCTCGGCGAGGAGCCGGGTGGCAGCCGCCGCCCAGGCGTCGAACGGCGGGGGCACGGCGGCCTCGAT GGTGTGGCGCTCCGCCCACAGCCGGGCACCGGCACGTGCCACGACGTCCTCGCTGAGGGACTGCGCGACCCGGAAGCTGA GGCCGACCAGCTTCTCCAGCGGAGGAACGCCGGGCGTGGTGTAGGCGGCGGCGAGTTGCGGCCAGGTGGCGAACTGCTCG CGGACCACGGCCAATGCCAGCTTCTCCTTGCTGGAGTAATGGAAATAGATCGCCCCGCTGGTTCTTCCCGAGTGATCGCT TATGTCATTGACGCTCGTTCCGGCATATCCCTGTTCAACGAACAGATGTGCCGCTGTTTCCAGCAGCACTTTGCGGGTTG CACGCGCCCTGTCCTGCACTTCTGCTCCACCTTCGCTCACACACGCCGACGCCACGACGGAAAAGTCCAGGCGCCCCCGG AGGCAGGGTCGCAGCGCGCGAAGATCTTATTATTCTTCGGCCGTGTGCGCCGCAGGGGCGACTTCACACAACACGCCCCT CTGTCCGGCCGAATCGATTCGGGCGGGCGCGGGAACTCCGCCCCGATCAATGCCGCGGCACCCCGGAGGAGCCTCCCACC AGGCCCGTTGGCCGCGTTCCGGGCGCTGCGGCCCCTCCTCCCGTTCCGTGTGCGGGAGTCGGCCGCCTTCGGCCGCGGCA TGCACATACACCCGAATCGGTGATTGTGGAAATCGAAGAAGCGAAATTAACATAACGGGCACGATGTTTTTCGGCGTGGT CGAAAGCGACGACCCGCGCCCGCCATGACGCGCCCACGGCGCGCTCAGCCGTGTCCACGTGGCCGCAAACGGCCGTGCGA CATCACCCGTTTCCGCTCATTTCCGCGAGTGGACCACCCGCATCCCTGCACCCGCCTGCGCCCGTGCACCGAACCGGTCG ACGCAGCCTCCCAGAACGGCAGTCACATGACAGCTCACCGCATCCTTTCCTGGTCCCCCTCCGCCATCGTCTTCGACTGC GACGGAACCCTGATGGACACGGAACGACACTGGCAGGAGGCCCGGAACCTCACCTTCCGGGCGTTCGGCCTGAAACCGCC GGCCGGGTTCGCCGACCGCGCCAAGGGCATGCACTACACCGAGTGCGGAGCGCTCATGGCCGAAGAGACCGGGAAACCGG GCCTCGTCGGGGAGTTGACCGACACGCTCCTCGGCACCTTCACCACCCTGGTCGACCAGGACCCCGTCACCATGCCGGGG GCCGCCTCGCTGGTCCGGCTGGCCTCTCGCCACCGCCCTCTCGCGGTGGCGAGCAACTGCCCCCGGGAGGTGGTGGAATC ATGCCTCTACCGGGCCGGGCTCCTCGACTGCTTCGGCCACGTCGTGGTCGCCGGCGGGGAGGTACGGCCGAAGCCGGAAC CCGACGTCTACGCGGTGGCCGCCCGCCTCTGCGGCGTCCCTCCCGAGGAGGCTCTGGCCGTGGAGGACTCGCTCACCGGT ATGGAGTCGGCCCGCCGGGCGGGCCTTCGCGTCATCGGCATCGGACCGTGCCCACCGGGGCCGGAGGCGGAGAAGGCCGA TCTGTGGGTCGCGAGCCTCGCCGACGGCGAGCTGCTGTCGTGGGCCCGCACCCGGATCGGCGAGTAGGACACCCGGGGCC CCGTCGCACGGGGCCCGGGCGGGAGGGGCGGACCGCGGTTCCGTCAGCGGAGCCGGGCGGCGAACGCCGCGTACGCCTGC TCGTCGAAGAGGACGAACCGGATCTCCTCCACCGCGGTTTCGGCGTCGCGCACCGTCTGCACCGCGATGCGCGCGGCGTC CTCCATCGGCCAGCCGTAGACACCGGTGGAGACGGCCGGGAACGCGACGGTGCGCGCGCCGAGTCCGTCGGCGACCCGCA GCGACTCCCGGTAGCAGGAGGCCAGCAGCGCCGAGCGGTCCTCCTCGCGGCTGAACACCGGGCCGACGGTGTGGATCACC CAGCGGGCGTCCAGGTCGCCGGCGGTGGTGGCGACGGCCCGGCCGGTGGGCAGGCCCTCGCCGTACCGAGAAGCGCGCAG GCGGCGGCACTCCTCCAGGATCGCCGGGCCGCCCCGGCGGTGGATCGCGCCGTCGACGCCGCCTCCGCCGAGCAGGGACG AGTTCGCCGCGTTGACGATCGCGTCGGCGCTCTGGCGGGTGATGTCGCCCCGGACGAGGGTGAGGGTGGCGCTCATGTCT GCCGCAGCCTCCTCCAGACGGCCTTCGCCGCGTTGTGTCCCGACATGCCGTGCACCCCGGGGCCGGGCGGGGTGGCCGAG GAGCAGATGAAGACCGCGGGGTGCGGGGTTGCGTACGGGAACAGGGACGGTCTGGGGCGCAGCAGGAGCTGGAGTCCGGA GGCCGCGCCGGTGCCGATGTCGCCGCCGACGTAGTTGGCGTTGCGGGCGGCGAGTTCGGGCGGGCCGGCGGTGGCGCGGG CCAGGACGCGGTCGCGGAAGCCCGGTGCGTAGCGCTCCAGCTGGCGCTCGATGGCGTCGGTGAGGTCTCCGGTCCAGCCG TGCGGGACGTGGCCGTAGGCCCAGAAGACGTGCTGGCCCGCCGGTGCCCGGGTGGGGTCGGCGACGCCGGGCTGCACGGT GATGAGGAACGGCGCGTCGGGGGCCCGGCCCTCCCGGGAGGCGGCGCGCAGGGCGGCGCCGATCTCCCCGCTGTCCGCGC CGATCTGCACGGTCCCGGCGACGCGGGCCTCGGGCGCGGTCCACGGCACCGGGCCGTCCAGCGCGTAGTCGATCTTGAAG ACGCCGGGGCCGTACCGGTAGTTCGCGTAGGTGCCGCCGAAGCCGGCGATGCGGGCCAGGGCGGTGGGCGAGGTGTCGAA GACGTAGGCGCGGGCGGGCGGCAGGTCGTCGAGGCGCTTGACCTCGTAGTCGGTGTGGACGCTGCCGCCGAGGTCCTTCA GGTACGCGGCGAGGGCGTCGGAGAGCGCATGGAGCCGCCGCGGCCACGGCCAGCCGCGGGCGTGCGCGGCGAGGGCGAAG ACGAGGCCGACGGCGCCGGTGGCGAGACCACCGAGGGGGGCCATCACATGGGCGACGAGCCCCGCGAACAGGGTCCTGGC CCGCTCGTCGCGGAAGCGGCGCATGAGCCAGGTCGAGGGGGGCAGGCCGACCAGGCCGAAGCGGGCGAGGGTGACCGGGT CGCGGGGCAGCGCGGTCAGCGGCAGGGACATGAAGTCGCGGACCAGGGTGTCCCACCTGGACAGGAAGGGTGCGACGAGC CGTCGGTACGGGCCGGCGTCGCGCGGGCCGAAGGAGGCGGCCGTCTCGCCGACCGACCGGGACAGCACGGCCGCGGTGCC GTCCGGGAAGGGGTGCGCCATGGGGAGCCGGGGGTGCATCCACTCCAGCCCGTAGCGCTCCAGCGGGAGGGCGCGGAAGG CGGGCGAGTTGATGCCGAGGGGGTGCGCGGCGGAGCACGGGTCGTGCCGGAAGCCGGGCAGGGTGAGCTCCTCGGTGCGG GCGCCGCCGCCCACGGTGTCCCTGGCCTCGAACAGGGCCACCGAGAAGCCGCGCCGGGCCAGCTCCACGGCAGCGGTCAG CCCGTTCGGCCCCGCACCCACCACGACCGCGTCGAGCATCGACGGCACCTTCGGACTCCTTCGTCAGCCGACGGCCACTG GCATCAGGATATGCCGGGGCGCCGGTACCGGGAGATCAGGCTCCTTCCGACAGCAGCCCCACCACCCGCTGTGCCGTGGC CGCGTCGCGGGCCGCGGTGAAGGGGAGGGTGTTGCCGCCGGTGATGCGGAAGGGCTCGCCCGCGCGGGTCAGATGGGTGC CGCCCGCCTCCTCGACCAGGAGCAGACCGGCCGCGTGGTCCCAGGCGGCTTCCCAGGAGAAGGCGGTGGCGTCGGACTCG CCGCGGGCGACGGCCAGGTACTCCAGGCCGGCCGAGCCGCAGGGACGGGGTGCCACGCCCTCGGTCCGCAGGGCGAGCAG GGACCGCTTCTGTTCGTCCGTGGTGAAGTCCGGGTGGGAGGTGGCCACGCGCAGGTCGCGGCCGGGTTCCGGGGAGCCCG CGCGGAGCCGTTCGCCGTCGAGGTGGGCGCCCTTGCCCCGTACGGCCGTGGCGAATTGGTGGCGGGCCGGGGCGAAGGTC CAGGAGGCGTACAGGACTCCGCGCCGGGCAAGGGCGACCAGGGTGCAGAAACCGGTGTCTCCGTGCACGAACTGCCGGGT GCCGTCGACGGGGTCGACTATCCAGACCGGCGCCTCGCCCCGAACCGCCTCGTACGACGTCGGGTTGGCGTGCACGGCCT CCTCGCCCACCACGACCGAGCCGGGCAGCAGGGCGGTGAGCGCCTCCGTGAGGTACAGCTCCGCCTTGCGGTCGGCGTCG GTCACGAGGTCGTGCGGGCCGCTCTTCAGGTCCACCTCGTGTTCGGCGAGCCGGCGCCAGCGCGGCATGATCTCCTGCGC GGCGGCCTTGCGGACGGCTTCCTCCACGTCGACGGCGTGCCGGTCGAGAAACTCTTCGATGGTTTCGTTGTCCTTGATCA TGCCTCCATGAGACCACGCCCGGCCGACGTTCCCCACCGTCCCGGTGCACTGCGGGGGGAATCGGCATGAATATGGGGTG CCGGACCACGGGCGGGTGGGCTCAGCGGCCCACCGCGTAG (SEQ ID NO: 1)

TABLE 3 Sequence of Moe Cluster 2 GACGAGAACGGAGTGCGCTGTTTCGGACGGGCGCGTCCGTCAGCGCGACGGAATGGACGGATCACGTGACCGACTTGCTG GAGCCGAGGCAACACTGGGTTAGGCGGTTACACCCTTCACCGGACAGCGATGTCACGGTGGTCTGCTTCCCGCACGCGGG TGGATCGGCCAGCTACTTCCACCCGTTGTCCGCTCGGCTGACGCCCCGTGCCGAGGTGCTGGCGCTGCAGTATCCGGGCC GCCAGGACCGCCGGTTCGAGCCTGCGCTCACCAGTATCGACGAGCTGGTGGAGGGAATCACCGAGGCGCTGCGCGAGCAC GTCGACCGGCCCCTCGTGTTCTTCGGGCACAGCATGGGCGGGACGCTCGCCTTCGAGACCGCGCGGCGCATGGAGCCGGA GCTCGACGGGCGGTTGCTGGGGCTGGTCGTGTCGGGGCGCAGGTCGCCCGGCAGCGTGCGCCGGACGACGGTGCATCTGC GGGACGACGCGGGGCTCATCGCGGAAATACGCGAACTGCAGGGGACCGCCTCGACGTTGCTGGACGACGAAGACGTGGTG CGGATGATCCTGCCGTCCCTCCGCGCCGACTACACCGCGGTGGAGCGGTACGTGTACCGGCCGGGACCGGCACTGAGTTG CCCCCTGTACGTCTACACCGGTGACGCCGATCCCCAGGTGAACGAGGAGGAGGCGGCGGGATGGGCGGAGCACACCCGCG CGGACTTCCGGATCCGCACTTTCAGCGGCGGTCACTTCTACCTCGCCGAGCAGAGCGAGCAGGTGATCGCGGCACTGCGT GAGGACGTGACGGGCTTCCAGGAGCGTTCCCGGACCGGCGCCGAGCGCTGATCCGGGCCCGGGAAGGGTGCACGCACCGG ACGTGAGGCCGTGCAGTTCAGCCACCGCCGGCGAAGCGGGCGGCGAGTTCGCGTTTCAGTACCTTGCCGCTCGGCCCGAG GGGGAAGTCCTCGACGAACTCCACCCGGCGCGGGTACTTGTACGCGGCGATTCGCTGCCTGCTCCAGGACACGATGTGCG CGGCCAGCGCCGCGTCCGGATCCGTGCCCGGCCGCGTCCGCACCACGGCGCACACCTCCTCGCCGTACTTGTCGTCGGGG ACACCGATGACGGCAACCTGGGCGACGGCCGGGTGACGCATCAGCACCTCCTCCACCTCGCGTGGATAGACGTTGTAGCC ACCGCGCAGCACCATGTCCTTCTTGCGGTCGACGATGGTCAGATAGCCGTCGGCGTCCTTCATCCCCAGGTCGCCCGAGC GGAACCAGCCGTCGACCAGCACGGCTGCGGTGGCTTCCGGCCGGTTGAGGTAGCCGGCCATGACGTTGTGGCCGCGTACG ACGATCTCCCCGATCTCCCCGGCCGGCAGCAGCTCGATACGGTCCTCCACGTCGGCGGCGGCGATCTCCGCCTCCACGCC CCAGATGGGGCGCCCCACGGTGCCGGGCCTGCGCGGCCACGCCTTCTGGTTGTACGCCACCACCGGCGAGGTCTCCGTGA GGCCGTACCCCTCGTAGATCGGGCAGCCGTAGACCTCCTGGAACTCCTCGAGCACCTTGACCGGTAGCGCCGAACCGCCG GAGAAGGCGCGGTCGAGCACGGGGCGGCGGGCGTCGTGAGCGGCGGCGTCGAGGAGGGCCAGGTACATGGTCGGGACGCC CATGAACACCGTGCAGCCCTCGGTGACCATGAGGTCGAGCGCGCCGGGGCCGTCGAAGCGGTTCATGAGCACCAGGGTGC CGCCGGCCAGGAAACAGGCGCTCATGCCGCAGGTCTGGCCGAAGGTGTGGAACAGCGGCAGACAGCCCAGCAGCACGTCC TCGGGGCCGAGGTCGAACGGCGAGCGCATCGTGGTGCTGACGTTCATCACCAGGTTGAGGTGGGTGATCATCGCGCCCTT GGGCCGGCCGGTGGTGCCCGAGGTGTACAGCACCAAGGCCAAGTCGTCGGGCGCGCGCGGCACCAGACCGTCCAGGGGCT CCGCCCGTTCGGCGAGCACGTCGAGGCGTGCCGGGCCGTCGTCGTCCTCGCCGTTCTCGACCATGACGGTGAGCAGCGGA ACCCCGGCCGTCCCGGCCGCCTTGGCGCCCTCGGTCAGCATCGGGGCCGCGCACACCATGGCCTTCGCCTCGGAGTCGCC CAGCACGTGGACGATCTCGTCGGCACGCAGCAGGCCGTGCACCGGGACCACCACGGCACCGAGCGCCAGCACGCCGTAGT ACACCATCGGGAAGTGCGGTGTGTTCGGCAGCAGCAGGGCGATCCGGTCGCCCGGGCGCACACCGCGGTCCCTCAGCACC GCCGCGTACCGGCGGGTTGCGAGCCAGAGCTCGGCGTAGGTGATGCGTTCGGAGCCGAAGACGAGCGCGGGGTGGTCGGG GCGTCGCCCGGCGGACTCGGCCAGTACGGACGCGGCGGTCAGGGTCATGCCGCACCGTTGTGCCGTGCGGCCAGCGCCGG CTTGTCCGGTTTTCCGGCACGGGTGAGGGGCAGCGCGTCGTGGAACGTCACCACGGCCGGTACGTGCTTCGGAGACAGCT CGGCGGCGACGTGGCCGATCAGCGTCCCGGAGTCGGCGGTGCCGCCCGGCCGTACCACGACGGCGGCGTGGATGTGCTCC ACGCGGTCCTCGTCGACCACGCAGTACACAGCGGCCTGGGTGACCTCCGGATGGGTCAGCAGCGCGTTCTCCACATCGGT GGGATGGACCTTGATGCCGTTGGTCTTCATCACCTCGCCCATGCGGCCGTGCAGGCGCAGAAAGCCGTTCTCGTCGAGGG AACCGAGGTCGCCGGTGTGCACCCAGCCGTCGCGGATGATCGCGGCGGTCAGCTCCGGTTCGCCCCAGTAGCCGAGCATG GTGGACGGGCTCTGCACGCACACCTCGCCGATCTCGCCGGGCGGCAGGTCGCGGTCGTCGTCCACGTCGCGGATGCGTAT CTCCGTGGTCGGAGGTCCGACGGTCCGGCGCAGTTCCGGGTCGAAGTGGTCCTGCGGCATCAGCATGCTGATGCCGTTGA CTTCCGTGGTCCCGTAGAGCTGGAGCAACACCGGGCCGAACACCTCGACCGCCTCGGCCAGTCGGGCGGGGGCCGCGGGG GAACCGAGGTAGGTGATGAGCCTGATGCTCGAACGGTCGGTGGTGGCGGTGTCGGGGTGGTCGATCAGCATGTACAGCTG CGGCGGGGTGATGGTCAGCGTGGAGACGCGGTGCTGTTCCACGGCCCGCAGCACTTCGCCCGCCTCGAACCCGTCGTGCA GGACGACCGTTCCGCCGGAGGCGAGCGCGACGTCGACGGCGGAGCCGCTGGAGTTGCTCACCGGCAGGGTCGACAGGTAC ACGATGGGTTCGGGGGACTGGAGGGCCACTTGGAGGTTGGCACGGCGAAGGCGGTACGGCTGCGTGACACCCTTGGGACG TCCGCTGGTACCGCTGGTGTAGATCACCACGGCCGGCTGTTCGGGGTCGGCCTCGACGGCGTCGTGGCCGAAGGCGTCCG GGTCGCCCGACGAGAGGTCCAGGACATCGGGGCCGAGGGCACCGAGAGCGGCGAGACGCGGTGGCTCGGGCAGCCGGTCG CACAGCTCTCGGGCCGCGTCGAGGTTCTCCTTGTCGACGGCGAGGAAGGTCGCCCCGGTCTTGCTGAGAATGTCCAGCCG GGCGGCGGCGGCCAGCTGGTCGGTGGGGTCCACCGCGTTCGTGGAGTGCAGGTGGACCAGGGTGGCCCCGGCCAGGTTGG CCGCGTAGCGGAGGATGATGGTCGCCGGGCTGTTGGTGACGGTCAGCACCGCCACAACCGGGGCCTTGCCTTCCGCACTC GGGTCTCGATGTTCCGTGAAGTGCCGGAGGAGAAGTTCCGCTGCCGTGAGAACCGCCCTGGAGACCTGGCCCGCGGTGAT TTCTTCACCATCCGCCCACAGGGCAATCCGGTCGGGGTCGGAGGCCAGCGCCTCAAGCACCCGGCGGACGTAATTCTCGT TCGAGGACATCGTTCCCCCACCATGCTGGTTCGTTTATCGGTCAGTGCAGACTTACATGATCGCGCGAAAGCGCGACAAC CCGCTCTCGGTAACCATTGGGCGTGCGGCCGGTGAGCACGGCTGCCGTCGGCCAGTTCTCAACACCCGGCAGCCTTGGGC GAGTTGAATGCCTGCCGGAGTCGATGATATACAGACGTTACCTTCATGCCCTTCCCCTGTGTTGCGAATGGTGAGGCCGC TCCCGCGCGATTTCGCCAGTGACACGTTCGCACCGGCGCCGGGGACAAGCAGAATCCAGTCATGGCCGTGCGATTTCAGC CAGTTATGCGCGGTTGCCCGTAGTTGCATGTAGCCTCAGACGGCCTGGAACGAAGCGAGTAGACGTGACGACCCAATATC TGGATCTCTTTGCACGCCTCACAGAAAACTCCGACGGGGGAAAGAGGGAGTTCCTGGAGATCGGACGGCTCGCCGGGAGC TTCCCCGCGGCCAGCGTCCGCAGCAGTGGACCCGTGACCGGCCGGGACAGCATCAGCGTCTGGTGCAGCAACGACTACCT CGGCATGGGCCAGCATCCCGCAGTGCTCAAAGCCATGAAGGACGCGATCGACGAGTACGGCGCCGGCGCCGGCGGCTCAC GCAACATCGGCGGCACCAACCACTACCACGTGCTGCTGGAGAGAGAGCTCGCCGCGCTCCACGGCAAGGACGAGGCCCTG CTGTTCACCTCCGGTTACACCGCCAACGACGGTGCGCTGTCCGTCATCGCCGGCCGCATGGAGAAGTGTGTCGTCTTCTC CGACGCACTCAACCACGCGTCCATCATCGACGGCCTGCGCCACAGCCGCGCCCAGAAGCAGATCTTCCGCCACAACGACC CCGCTCACCTGGAAGAACTGATAGCGGCGGCCGACCCCGACGTCCCCAAGCTCATCGTCGCCGAGTCCGTGTACTCGATG AACGGCGACATCGCCCCGCTGTCCGAAATCGCCGACATCGCCAAGCGCCACGGGGCGATGACGTACCTCGACGAGGTGCA CGCGGTGGGCATGTACGGCCCGGAGGGTGCCGGCATCGCGGCCCGGGAGGGCATCGCCGACGACTTCACCGTCATCATGG GCACCTTGGCCAAGGGTTTCGGCACCACCGGCGGCTACATCGCAGGGCCCGCCGAAATCATCGAGGCGGTGCGCATGTTC TCCCGCTCCTTCGTCTTCACCACCGCGCTGGCGCCGGCCGTGGCCGCCGGCGCCCTGGCAGCCGTACACCATCTGCGGTC CTCCGAGGTCGAGCGGGAACAGCTCTGGTCGAACGCGCAGTTGATGCACCGGCTGCTGAACGAGCGTGGCATCCCCTTCA TTTCGGACCAGACGCACATCGTGTCCGTCATGGTGGGGGACGAGGCCGTGTGCAAGCGGATGTCCGCGCTGCTGCTCGAC CGGCACGGAATCTAGGTGCAGGCGATCAACGCGCCGAGCGTGCGGGTCGGTGAGGAGATCCTGCGGGTCGCCCCCGGAGC CGTGCACACCGCCGACGACGTACGCGAATTCGTCGACGCTCTGAGCCAGGTCTGGGAGGAAGTGGGCTCCGCCCGCGTGC CGGCGACCCCGGCCGCTCTCTGATCCGTCCACGTCAAGATGTGCGGGCCACGGCTACGCCGGCCAGATGTGCGGACTCCG GTTCTCGGGGAGGGCGGTGTGTCTTTGACGTGTCACGCACGTACGGCGGAAACGAACGGCGCTTTCCTCACGGACCATGG ACAGGGACCCTGCCCCACGGTCAGGACACGGACAATGTCGAAAGGCTGCCGGAAGGCTCGCAGGACATGCGCCTCGGCCG AAGACAAGTCCGGCCGGTCCCTCATACTCGACCCACAGGTCCCGGAACCCCGCGCATCCGGAGAACGGGCCGGACACCCG TGGAGTGCCCGGCCCGTCACGGTGCCGCGTACCTACGTGTCTGCTCGGAGAACGCCGTCTACCTCGCCATGGCCGTGCTC CTGGTGCGTCGCCTCACGAGACCAGTCCGCTGAGAGGCTTTTCAGACGCCCTCTAGGGTGCGGGCTCCCAGATCCTGACC GCACCGCAACCGCACGACTGGACCGAGATCGGCGGCCGGGTGCTCCTGGAGTGCTGAGCCTGCGCTCAGCCCATCTCCTC CAACGTCCTGCCCTTGGTCTCCGGCACCCACTTGAGGATGAACGGGACCGCCAGCGTGGCGAAGATCGCGTAGATCACGT ACGAACCGGACAGGTTCCACTCCGCCATGCTCGGGAACGTCGCGGTGACCAGCCAGTTGGCGACCCACTGCGCGCAGGCG GCGACGCCGAGCGCGGCCGCGCGGATGCGGCTGGGGAACATCTCGCCCAGCAGCACCCAGGCCGCCACGCCCAGCGACAT GGCGAAGAAGAGGACGAAGGCGTGGGCGGCGACCAGCGCGACGGTGGCCTGGGTGTCGGGCAGCGAGATGTCGTCACCCG TTCCGGTCTTGTAGGAGAAGGCCCAGGCGACGGCGGCGAGGGAGACCGCCATACCGGCGGAACCGGTCGCGGCCAGCGGC TTGCGGCCGACCCGGTCGATGAGCACCATCGCGATCACCGTGCCCACGATGTTGATGACCGAGGTGGTGAACGAGTAGAA GAACGAGCTCGACGGGTCGATGCCCACCGACTGCCACAGCGAGGAGCTGTAGTAGAAGATCACGTTGATACCGACGAACT GCTGGAAGACCGACAGGCCGACACCGACCCAGACGATCGGCAGCAGGCCGAAACGGCCGCGGAGGTCCTTGAACCGCGGT GCCTTGTCGCTGCGCGCGGCGTGCTCGATCTCGGCCACCCGGGCATCGAGATCGACCTGCGCGCCTTCGAGGGTGCGCAG CACCTCCTTGGCCTCGCCGGTCCTGCCGACTGAGACCAGGTAGCGCGGCGACTCCGGGATGCGCAACGCCAGCAGACCGT AGACCAGGGCGGGAACGGCCGCGATGCCGAGCATGACCTGCCACGCCTCCAGTCCGAGCAGGCTGCCGCGCTGGTCCCCG TCGGCGAGGGAGAGCACCATCCAGTTGACCAACTGGGAGACGGCGATGCCCAGCACGACGGCGGCCTGCTGGAAGGAGAC GAGCCGGCCGCGGTACTCGGTGGGCGCGACCTCGGCGATGTACGTGGGGCCGATCACGGAGGCCATGCCGATGGCCACGC CGCCCACGATGCGCCAGAAGGACAGGTCCCACGCCGTGAACGGCAGCATCGAGCCGATACTGCTGGCCAGGAAGAGCAGG GCGGCCAACTGCATCACCCGGAGGCGGCCGACGCGGTCGGCGAGCCGTCCCGCGAGCATGGCGCCGGCGGCCGCGCCGAG CAGGGCGATGGCGATGACGGCTCCGAGCGTGGCGGCGCCGACGTCGAACCGTCCCCTGATGCCCTCGACGGCGCCGTTGA TCACGGCGCTGTCGTAGCCGAAGAGGAAGCCGCCCATGGCGGCGGACGCCGCGATGAAGACGACGTGAGGCAGCTGGTTC GGCCGGGCCGCGAGGCCCCCGGCGGCGGGTTGTTGTGCTGTGCTCGTCACCTAAGGACTCCTGTGGTGATGTGTGTCGTT (SEQ ID NO: 2)

The clusters, and relative positions of the genes are shown in FIGS. 3A-3B. In general, the genes for chromophore biosynthesis appear to be in cluster 2; all other moe biosynthetic genes appear to be in cluster 1.

Constant rearrangements of chromosomal markers located in the ends of linear streptomycete chromosome is well documented (Redenbach 1993, Bentley 2002, Hopwood 2006). This may account for the duplication and divergence of the moe chromophore biosynthetic genes in S. ghanaensis, thus leading to the two-cluster organization. Such clusters are not unique. For example, the ansamitocin and clavam biosynthetic pathways from Actinosynnema pretiosum and S. clavuligerus, respectively, are also encoded by unlinked groups of genes (Yu 2002, Tahlan 2004). These findings suggest that a multi-clustered organization of secondary metabolism genes might be more common than it is anticipated.

A summary of the identified genes and their proposed function is presented below in Table 4. A detailed discussion of each gene and its function is presented in the Experimental Examples, Section III. In total, twenty-three open reading frames (ORFs) were found to be related to moe A biosynthesis (i.e. moe biosynthesis-related genes). The function of the encoded proteins was determined via bioinformatic and genetic analysis. The identified genes are sufficient for the biosynthesis of all four structurally different parts of moe; that is, the genes encoding the proteins necessary to form a structurally complete moe A molecule were identified.

TABLE 4 Deduced functions for genes in moe gene clusters 1 and 2 Amino ID %/ Acc. No. of Cluster ORF Acids Homologue SI % homologue Proposed function 1 MoeB5* 301 Putative acyl CoA ligase 58/76 AAX98210.1 Nonfunctional acyl (S. aizunensis) CoA ligase 1 Moe A5 394 As in case of MoeC4 64/78 AY240962 Nonfunctional Aminolevulinate synthase 1 MoeD5 638 Putative ABC transporter 41/55 YP075256.1 ABC transporter (Symbiobacterium thermophilum) 1 MoeJ5 564 As above 45/61 YP075255.1 ABC transporter 1 MoeE5 340 Putative UDP-glucose 4-epimerase 46/58 YP074610.1 NDP-hexose (Symbiobacterium thermophilum) 4-epimerase 1 MoeF5 645 WbpS 29/43 AAF24002.1 Unit F (Pseudomonas aeruginosa) Amidotransferase 1 MoeGT1 402 Putative glycosyltransferase 28/40 EAM38951.1 Glycosyltransferase (Polaromonas sp) (transfers unit F) 1 MoeH5 513 AsnB-like amidotransferase 32/48 CAI08539.1 Unit B (Azoarcus sp) Amidotransferase 1 MoeK5 407 Putative methyltransferase 34/52 NP142754.1 Methyltransferase (Pyrococcus horikoshii) 1 MoeGT4 427 Putative glycosyltransferase 27/38 EAS23724.1 Glycosyltransferase (Mycobacterium vahbaalenii) (transfers unit E) 1 MoeM5 530 GdmN 29/44 AAO06921.1 Carbamoyltransferase (Streptomyces hygroscopicus) 1 MoeN5 260 Putative prenyltransferase 30/58 NP220145 Prenyltransferase (Chlamydia trachomatis) 1 MoeO5 281 GGGPS 27/43 JC7965 Farnesyl-3- (Thermoplasma acidophilum) phosphoglycerate synthase 1 MoeX5 266 Putative membrane protein 26/40 EAS99725.1 ABC transporter (Mycobacterium sp) membrane protein 1 MoeP5 233 ABC transporter ATPase 43/58 EAS11435.1 ABC transporter (Mycobacterium flavescens) ATP-binding protein 1 MoeGT5 312 MoeGT4 (see above) 45/59 Glycosyltransferase (transfers unit C) 1 MoeGT2 286 Putative glycosyltransferase 35/51 AAU93096.1 Glycosyltransferase (Methylococcus capsulatus) (transfers unit B) 1 MoeGT3 414 Putative glycosyltransferase 44/56 ZP_00616987.1 Glycosyltransferase (Kineococcus radiotolerans) (transfers unit D) 1 MoeR5 374 CapD (Nocardioides sp) 53/68 EAO07657.1 Hexose-4,6-dehydratase 1 MoeS5 282 SCO7194 62/75 CAC01594.1 Hexose-4-ketoreductase (Streptomyces coelicolor) 2 Moe A4 516 Putative acyl CoA ligase 63/73 AAX98210.1 Acyl CoA ligase (Streptomyces aizunensis) 2 MoeB4 521 SimL 45/62 AAG34163.1 Amide synthetase (Streptomyces antibioticus) 2 MoeC4 412 HemA-AsuA 70/83 AY240962 Aminolevulinate (Streptomyces asukaensis) synthase

EXAMPLES

Genes involved in the synthesis of moe A were cloned and characterized from S. ghanaensis ATCC14672. This was followed by bioinformatic and genetic analysis of the identified moe sequences via a combination of gene disruption and heterologous expression approaches. Although not wishing to be bound by any theory, a likely moe A biosynthetic pathway has been elucidated (discussed below in section V). This pathway (see FIGS. 4A-4D) appears to explain the mechanism of phosphoglycerate incorporation into bacterial secondary metabolites. Furthermore, the pathway provides a basis to generate and identify bioactive derivatives and intermediates of moe A, which may have clinical use as peptidoglycan glycosyltransferase inhibitors.

This section is divided into five main parts. Part I describes exemplary materials and methods used in many of the examples that follow. It will be clear to those skilled in the art that in some aspects of the experimental examples, other methods, reaction conditions, protocols, etc. may be used with comparable results. Part II describes the cloning of moe A genes from S. ghanaensis ATCC14672, and Part III describes the bioinformatic and genetic analysis of each gene identified in Part II. Part IV describes the characterization of several moe A intermediates. Part V describes a theoretical, overall assembly scheme for moe A based on the information from Parts II and III. Finally, Part VI includes additional experimental examples to show the diversity and utility of the methods and compositions disclosed herein. All experimental examples, whether actual or prophetic, are presented to be instructive and not limiting.

I. Materials and Methods

A. Bacterial Strains and Vector DNAs

Moes producers S. ghanaensis ATCC14672 and S. bambergiensis NRRL-B12101 were obtained from American Type Culture Collection (“ATCC”) and the U.S. Department of Agriculture, respectively. S. lividans TK24, S. coelicolor M145 (Kieser 2000), S. cyanogenus S136 (Westrich 1999) were used in studies on moe A resistance in Streptomyces. Bacillus cereus ATCC19637 was used as a moe-sensitive test culture.

Escherichia coli NovaBlue (Novagen, San Diego, Calif.) was used as a general cloning host. E. coli XL1 Blue MR and cosmid SuperCos1 (Stratagene, La Jolla, Calif.) were used for generation of the S. ghanaensis genomic library. Methylation-deficient strain E. coli ET12567 carrying conjugative driver plasmid pUB307 (Flett 1997) was used for intergeneric E. coli-Streptomyces conjugations. E. coli BW25113 (pLU790) was from John Innes Centre (Norwich, UK). S. lividans J1725 (bidA mutant) and pIJ584 plasmid harboring intact bidA gene were donated by B. Leskiw (University of Alberta, Canada). Strains S. ghanaensis MO12, LH1, OB20a, OB21e with disrupted moeGT3, moeA4, moeM5, moeGT1 genes and S. lividans strains expressing various subsets of moe genes were constructed as described below.

Conjugative shuttle vector pKC1139 with temperature-sensitive pSG5 replicon (Muth 1989, Bierman 1992) was used for gene disruption and expression in S. ghanaensis. Vector pMKI9 is a derivative of pKC1139 with the ermE promoter inserted into a polylinker (provided by I. Ostash, Dept. of Genetics, Ivan Franko National University, L'viv, Ukraine). Vectors pKC1139, pSET152, pMKI9, pOOB40 are described in Ostash 2007. Integrative vector pSOK804 (Sekurova 2004) was from S. Zotchev (Norwegian University of Science and Technology, Trondheim, Norway). Expression vector pAF1 (ori^(pIJ101) bla tsr, PermE*, 6His tag) was provided by A. Bechthold (Freiburg University, Germany). Plasmids pKD4 and pCP20 (Datsenko 2000) were from J. Beckwith (Harvard Medical School, USA). Spectinomycin resistance cassette pHP45 was from J.-L. Pernodet (Université Paris-Sud, France). Apramycin resistance marker aac(3)IV in integrative conjugative vector pSET152 (Bierman 1992) was replaced with a spectinomycin resistance gene aadA to yield plasmid pOOB5. Plasmid pOOB40 carrying hygromycin resistance marker (“hyg”) was generated in the same way.

B. Media and Culture Conditions

LB and LA media were used for cultivation of E. coli strains and Bacillus cereus ATCC19637. For moe A production, S. ghanaensis was grown on solid YMA medium (yeast extract: 4 g/L; malt extract: 10 g/L; glucose: 4 g/L; agar: 18 g/L; pH prior to autoclaving was adjusted to pH 7.5) or in liquid medium (LM) described in Subramaniam-Neihaus (1997) or in mTSB (tryptic soy broth supplemented with 0.5 g MgCl₂ and 2.5 mL of trace elements solution (Kieser 2000) per 1 L of medium). For abundant sporulation, Streptomyces strains were grown on OM agar (Gromyko 2004). E. coli-S. ghanaensis conjugative mixtures were plated onto either MS agar (Kieser 2000) or OM agar supplemented with 10 mM MgCl₂. For chromosomal DNA isolation S. ghanaensis was grown in TSB. E. coli strains were cultivated at 37° C. B. cereus and Streptomyces strains were grown at 30° C. for 2-3 days unless otherwise stated in a description of specific procedures.

C. DNA Manipulations

Table 5 summarizes the primers used for PCR. Plasmid preparation from E. coli was carried out using Qiagen nucleic acid isolation kits according to the manufacturer's instructions (Qiagen, Valencia, Calif.). Total DNA from S. ghanaensis was isolated using a salting out method (procedure B; Kieser 2000). For genome sequencing, chromosomal DNA of S. ghanaensis was isolated from a strain passed through three 4-day rounds of growth at 40° C. (to obtain a strain free of endogenous plasmid pSG5) and additionally purified using Qiagen Genomic-tip 500/G (Qiagen, Valencia, Calif.). Ultracentrifugation experiments and in silico genome analysis showed that the total DNA submitted to the Broad Institute (Cambridge, Mass.) for genome sequencing did not contain pSG5 or other small (e.g., 10-50 kb) plasmids.

For recovery of shuttle E. coli-Streptomyces plasmids from S. ghanaensis, E. coli was transformed with the total DNA of recombinant, plasmid-containing S. ghanaensis strains and then selected for appropriate resistance markers. Plasmid DNA from E. coli clones was then isolated and mapped with restriction endonucleases to confirm their identity.

Restriction enzymes and other molecular biology reagents were obtained from commercial sources and used according to the manufacturer's instructions. DNA treatment with endonucleases, Klenow fragment, T4-polymerase, phosphatase and T4-ligase was performed using standard methods (e.g., Sambrook 1989). Southern analysis, digoxigenin labeling of DNA probes, hybridization and detection were performed according to the manufacturer's protocols (Roche, Alameda, Calif.). PCR was performed using KOD Hot Start DNA polymerase (EMD Biosciences, San Diego, Calif.) with addition of DMSO to reaction mixture (10% of final volume).

TABLE 5 Oligonucleotide Primers for PCR Analysis Name Primer Sequence (5′ to 3′) SEQ ID NO. ligup1HindIII AAAAAGCTTGACGACTTGGCCTTGGTGCTGT 49 ligrp1EcoRI AAAGAATTCCGTTTCAGTACCTTGCCGCTCG 50 CTcon73for AAAAAGCTTGACCGGGAACTCGCCGAG 51 CTcon73rev AAAGAATTCGTCGTAGGGAACGGCCCG 52 GTcon72for AAAAAGCTTGACCTGACACTCGTCGGCTTC 53 GTcon72rev AAAGAATTCTCGAGACGAGGAGCCCGTAC 54 GT2con73up AAAAAGCTTGTTCTGCGACGCGGACGAC 55 GT2con73rp AAAGAATTCAGGTTCGGAACGTGCAGCA 56 moeM5nEcoRXbaup AAGAATTCTAGATCGAGTGGGCTCCCTACTC 57 moeM5nEcoRIrp AAAGAATTCACCTGGGGGAGTGACCGAC′ 58 moeGT5up_P1 GCAGTGCGACGCGAGCGCACGAGCAGACGTCGTCATGT 59 GTAGGCTGGAGCTGCTTC moeGT5rp_P2 TCGGGGTGACCTCGTGTGTCAGCGCCCGGCGGCCGCTCC 60 ATATGAATATCCTCCTTAG moeGT2up_P1 CGAGGAGCCCGCCGCGGGAGCGGCCGCCGGGCGCTGAC 61 AGTGTAGGCTGGAGCTGCTTC moeGT2rp_P2 GCCGAGGTGCCGTCCACGCCGTTCCCCCTCCGTCGGCTA 62 CATATGAATATCCTCCTTAG moeGT2up_check ACGAGGGGGACTTCCGCAG 63 38start_KD4 CGTGCGCAGCGCGGTCTTCGGCTTCGACGGGGTACGGAT 64 GAATATCCTCCTTAGTTC moeA5_P3 AGACGCGCCGGGCGGCCCCCAGTTCGGACCAGATGCCG 65 TAGGCTGGAGCTGCTTCG P2_KD4 CATATGAATATCCTCCTTAGTTC 66 alsrev1 AAATCTAGATCAAGAGCGGCCGGGGTC 67 moeF5up_P3 CGGCTCCTCGGTGTCCGTGCCGCGGCTGTAGGCGGCATG 68 TAGGCTGGAGCTGCTTC moeF5rp_P2 TGGACGAGCGGTCGGTCGGGGGCAGCCATGGGTCTCCT 69 ACATATGAATCTCCTTAGTTC F5check_up GTCTCGGTCAACGAAGTGGTC 70 F5_check_rp CTCTCCAGGGAGATGGTCCG 71 moeGT4up_P3 TGCACAGCCTGTACCGGTCGACCTCCAACACCGACCGTG 72 TAGGCTGGAGCTGCTT moeGT4rp_P2 TCAGCTCTCCTGACGCGTGGGTGAGGACGACGGAGTGA 73 GCATATGAATCTCCTTAGTTC moeK5-P1 TCCAGAAGCGGGCCGGCGTGCTGCCGCACCTCGGGGCT 74 GTAGGCTGGAGCTGCTTCG moeK5-P2 TGTGCAGGCCGTCCAGCGTGTTGCGCCACTGGCCGGTCA 75 TATGAATATCCTCCTTAG GT2con72up AAAAAGCTTGTTCTGCGACGCGGACGAC 76 GT2con73rp AAAGAATTCAGGTTCGGAACGTGCAGCA 77 moeH5up_P1 AGGCCGCCCTCCAGCCCCTGCTGGACGCCCGATGACGGT 78 GTAGGCTGGAGCTGCTTC moeH5rp_P2 TCTCGTGAAGTGGGGGTCTGCGGCGGTCCGGCCCCGCTA 79 CATATGAATATCCTCCTTAG moeN5up_P1 CCGGCCACGGCCCTGCCGGCGGACTACACGGAGACCAT 80 GTAGGCTGGAGCTGCTTCG moeN5rp_P2 GGACGGCCGGCCGGAGACGCTCCGGCCGGCCGTCGGTC 81 ATATGAATATCCTCCTTAG moeGT3intMfeI AAACAATTGTTCTGCGACGCGGACGAC 82 orf1intXbaI AAATCTAGAGGACTCTGCACCCTGAC 83 moeR5XbaIup AAATCTAGACGCGATGAACCGTCACG 84 moeGT3XbaIup AAATCTAGACGTGCCCTTCGACGACCCG 85 moeGT3EcoRIrp AAAGAATTCCCACGCCCTGGTCCTGGAC 86 moeN5XbaIup AAATCTAGACAGGTCACCGAGTACCTCGA 87 moeN5EcoRIrp AAAGAATTCCGCTGATCAACACGTCGCTC 88 moeF5XbaIup AAATCTAGACACCCAGATCGAGTGGACC 89 moeF5EcoRIrp AAAGAATTCATGGGTCTCCTAGGAGAG 90 moeGT4XbaIup AAATCTAGAGTACCGCTCCTTCTTCATGC 91 moeGT4EcoRIrp AAAGAATTCAGTGGAGCGACAGTACCTTC 92 moeH5XbaIup AAATCTAGACTGGACCAGGACGCGGTG 93 moeH5EcoRIrp AAAGAATTCGCTGATGTCTCGTGAAGTGG 94 moeGT5XbaIup AAATCTAGAGGGACCGGACTCGGACGT 95 moeGT5EcoRIrp AAAGAATTCGGTGACCTCGTGTGTCAGC 96 moeGT2XbaIup AAATCTAGAAGGGCCTGCACTTCACCT 97 moeGT2EcoRIrp AAAGAATTCGCCGTCCGGATCGACCA 98 moeK5XbaIup AAATCTAGATCCAGCGTGTTGCGC 99 moeK5EcoRIrp AAAGAATTCACGAGACATCAGCCG 100 moeO5HinDIIIup AAAAAGCTTCGGGGCGTGCCTTCTTC 101 moeO5XbaIrp AAATCTAGACCGCCCGCTCCCCGGAC 102 moe2HindIII-up AAAAAGCTTGACGTGAGGCCGTGCAGTTC 103 moe2MfeI-rp AAACAATTGGCACATCTTGACGTGGACGG 104

The plasmid and cosmid libraries for S. ghanaensis ATCC14672 genome sequencing were created at Broad Institute (Cambridge, Mass.). The cosmid library used for the retrieval of moe clusters 1 and 2 was constructed using the SuperCosI Vector system (Stratagene, La Jolla, Calif.) according to manufacturer's instructions. Sequencing of cosmids moeno5, 38, 40 and their subclones was done at Biopolymers Facility of Harvard Medical School using standard (M13, T4, T7, T3) and custom designed primers.

D. DNA and Protein Sequence Analysis

The generation, assembly and analysis of S. ghanaensis genomic sequences will be described separately. Briefly, the draft assembly yielded 1018 contigs containing 7.4 Mbp of S. ghanaensis genome (at 6.6× coverage) and about 1.2 Mbp are estimated to lie in the gap. BLAST search tools (on the server of the National Center for Biotechnology Information, Bethesda, Md.), FramePlot2.3.2 (Ishikawa 1999), CUPplot1.0 and Lasergene software package were used for S. ghanaensis sequences assembly, analysis and annotation. Homologues of moe gene translation products were found through BLASTP. Pair-wise amino acid sequence alignment was performed using the sequence analysis program on the server of European Bioinformatics Institute (Cambridge, UK). CDD search engine (BLAST server) and a set of programs (HHPred, Pfam, TMHHM) on ExPaSy proteomics server were utilized for identification of topology and conserved domains of the moe proteins.

E. Identification and Cloning of Moe Gene Clusters 1 and 2

Using BLASTX, all contigs provided by the Broad Institute were scanned in silico for the presence of clustered genes for glycosyltransferases, sugar tailoring genes and genes involved in isoprene metabolism. Seventy contigs containing at least some of the expected genes were then analyzed in more details using FramePlot and BLASTP programs. One stand-alone contig, contig 908, and three adjacent contigs 71, 72, 73 were identified as most likely carrying all or most of the genes involved in moe biosynthesis. On the basis of contig 908, sequence primers ligup1HindIII and ligrp1EcoRI were designed to amplify 1039 base-pair internal fragment of the moe A4 gene (which spans the moe A4 coding region from amino acid 160 to 506) from the S. ghanaensis ATCC14672 genome. Primers GTcon72for and GTcon72rev (designed base on the contig 72 sequence) were used to clone a 424 base-pair internal fragment of the moeGT1 gene (amino acids 164-305). A 489 base-pair fragment of the moeGT3 gene (amino acids 228-390) was amplified with primers GT2con73up and GT2con73rp (designed based on contig 73 sequence). DIG-labeled fragments of the moeA4, moeGT1, and moeGT3 genes were used to probe a S. ghanaensis cosmid library. Positive cosmids moeno38 and moeno40 were found to carry overlapping segments of S. ghanaensis genome that cover contigs 71, 72, 73. Cosmid moeno5 was found to cover contig 908. The aforementioned cosmids were used to finish sequencing moe clusters 1 and 2 (e.g., fill gaps of poor sequence resolution). One cosmid, moeno5, carried 3 moe biosynthetic genes, moe A4, moeB4, and moeC4 (moe cluster 2); the other two cosmids carried the rest of the identified moe genes (moe cluster 1). (See, e.g., FIGS. 3A-3B).

F. DNA Introduction into E. coli and Streptomyces Strains

Introduction of plasmids and cosmid library sequences into E. coli was done as described in Sambrook 1989. A slightly modified procedure of Streptomyces-E. coli conjugation (Kieser 2000) was employed to introduce plasmids into S. ghanaensis strains. Particularly, heat shocked ungerminated spores were used for matings, and conjugation mixtures were overlaid with selective antibiotics after 10 hours of growth. Conjugations with S. ghanaensis disruption mutants, aimed at obtaining complemented strains, were performed at 37° C. and overlaid after 7-8 hours of growth to avoid the excision of disruption plasmid from the mutated moe gene. The average frequency of appearance of S. ghanaensis pKC1139+ transconjugants was 6.6×10⁻⁵. Plasmid pSET152 was transferred into S. ghanaensis ATCC14672 at frequency 1.0×10⁻³. There was one attB^(φC31) site in the S. ghanaensis chromosome (as judged from Southern analysis of the transconjugants, see e.g., FIGS. 5A-5B). Also, free copies of pSET152 exist in S. ghanaensis cells as evident from Southern analysis and plasmid DNA analysis of Amr clones of E. coli obtained after transformation with total DNA of pSET152+ transconjugants. S. ghanaensis pSETIS2+ and pKC1139+ transconjugants did not differ from wild type in their ability to grow, sporulate and produce moe A. The introduction of plasmid and cosmid DNA into S. lividans was carried out according to published procedures described in Kieser 2000.

G. Construction of Plasmids for Moe Gene Disruptions and Expression

Internal fragments of moe A4 and moeGT1 genes used for screening the S. ghanaensis cosmid library were cloned as HindIII-EcoRI fragments into the HindII and EcoRI sites of pKC1139 to yield pKC1139lig3 and pOOB21e, respectively. An EcoRV fragment carrying hygromycin resistance cassette hyg (Kieser 2000) was excised from pHYG1 (Zhu 2005) and inserted into blunt-ended BamHI site of pKC1139lig3. In this way plasmid pLH1 was generated with internal moe A4 fragment being divided into two “arms” of 1 and 0.1 kb in length.

A 462 base pair internal fragment of moeM5 gene (corresponding to amino acid region 214-356 of the moeM5 protein) was amplified from cosmid moeno38 with primers CTcon73for and CTcon73rev. The PCR product was then digested with HindIII and EcoRI and cloned into the corresponding sites of conjugative E. coli-Streptomyces vector pKC1139 to yield plasmid pOOB20a. E. coli ET12567 (pUB307) was transformed with pOOB20a and the resulting strain was used as a donor in E. coli ET12567 (pUB307, pOOB20a)—S. ghanaensis ATCC14672 intergeneric conjugation. In this way, plasmid pOOB20a was transferred into S. ghanaensis. Under permissive conditions (growth at 30° C.) pKC1139-based plasmids replicate in Streptomyces hosts, but at temperatures higher then 34° C., these plasmids are either eliminated from the cells or forced to integrate into host's genome via homologous recombination (Muth 1989).

A 5 kb EcoRI fragment carrying 3′-truncated moeC4 gene and entire moeB4 genes was retrieved from cosmid moeno5 and cloned into EcoRI site of pOOB5 resulting in pKC11395-8 plasmid.

Gene moeGT1 along with its putative ribosome binding site (“RBS”) was amplified from cosmid moeno38 with primers moeGT1XbaIup and moeGT1EcoRIrp, treated with XbaI and EcoRI and cloned into XbaI-EcoRI digested pMKI9 in order to fuse moeGT1 with ermE promoter. From this intermediate construct (named pOOB32) PermE-moeGT1 was excised as a HindIII-EcoRI fragment, treated with T4 DNA polymerase and cloned into EcoRV site of pOOB40 to give pOOB41c.

Gene moeM5 along with its RBS was amplified from cosmid moeno38 with primers moeM5nEcoRXbaup and moeM5nEcoRIrp. The final pOOB40-based construct pOOB43a carrying PermE-moeM5 was generated in a two-step manner, similar to the construction of pOOB41c.

Genes moeD5 and J5 along with their putative promoter region were cloned from cosmid moeno38 with primers con71end and con72start, treated with XbaI and EcoRI and inserted into pMKI9 to yield pOOB38.

Additionally, gene moeM5 along with its ribosomal binding site was amplified from cosmid moeno38 with primers moeM5nEcoRXbaup and moeM5nEcoRIrp, treated with XbaI and EcoRI and cloned into XbaI-EcoRI digested pMKI9 (pKC1139 derivative with strong constitutive Streptomyces ermE promoter) in order to fuse moeM5 with ermE promoter. From this intermediate construct (named pOOB42) PermE-moeM5 was excised as a HindIII-EcoRI fragment, treated with T4 DNA polymerase and cloned into EcoRV site of pOOB40 (actinophage φC31-base integrative E. coli-Streptomyces vector) to give pOOB43a. This plasmid was introduced into S. ghanaensis OB20a strain for pOOB20a introduction into S. ghanaensis ATCC14672. The introduction of an intact copy of the moeM5 gene into S. ghanaensis OB20a strain was performed to demonstrate the restoration of moe A production in the mutant.

Genes moeF5, moeH5, moeGT4, moeGT5, moeGT2, moeGT3, moeN5, moeK5 were amplified via PCR using cosmid moeno38-1 as a template and respective primers listed in Table 4. Restriction sites for endonucleases XbaI and EcoRI were engineered into the primers to facilitate the cloning of the moe genes into XbaI-EcoRI digested vector pMKI9. The following plasmids were constructed: pOOB48a (moeF5), pOOB51 (moeH5), pOOB50 (moeGT4), pOOB52 (moeGT5), pOOB56c (moeGT2), pMOl13 (moeGT3), pMO17 (moeN5), pKC1139EmoeK5 (moeK5). Genes moeO5moeN5 plus moeO5-moeX5 intergenic region were amplified using primers moeO5HindIII and moeN5EcoRI. The amplicon was digested with restriction endonucleases HindIII and EcoRI and cloned into respective sites of pSOK804 yielding pOOB63a. Gene moeO5 was amplified with primers moeO5HindIII and moeO5XbaIrp and cloned into HindIII-XbaI-digested vector pAF1 to give pMoeO5extra. Plasmids pOOB63a and pMoeO5extra were used for complementation of ΔmoeN5 strain. Genes moeA4moeB4moeC4 (moe cluster 2) were amplified with primers moe2HindIII and moe2MfeI. The resulting PCR product was treated with HindIII and MfeI and cloned into HindIII-EcoRI digested pSOK804 thus giving pOOB64b. A 1.5 kb HindIII-EcoRI fragment containing moeGT3 fused to PermE was excised from pMO13, treated with Klenow enzyme and cloned into EcoRV site of pOOB40 (Hy) to give pMO14. This plasmid was used to complement moeGT3-deficient S. ghanaensis MO12 strain and to construct plasmid pOOB58 (see next chapter). Gene moeB4 has been subcloned from pOOB12 (Ostash 2007) as a XbaI-EcoRI fragment into respective sites of pMKI9, giving pOOB46e. This plasmid was coexpressed with various moeno38-1 derivatives to study the chromophore (unit A) biosynthesis. The fragment of moe cluster 1 containing moeR5moeS5 genes and putative moeS5 promoter (PmoeS5) was amplified with primers moeGT3intMfeI and orf1intXbaI, treated with MfeI and XbaI and cloned into XbaI-EcoRI digested pMKI9 to yield pOOB49f. PmoeS5-moeS5 fragment was retrieved from pOOB49f via XbaI-EcoRI digestion and cloned into pMKI9 to give pOOB55. Gene moeR5 was amplified from moeno38-1 with primers moeR5XbaIup and moeGT3intMfeI and cloned into XbaI-EcoRI digested pMKI9 to yield pOOB59. Plasmids pOOB49f, pOOB55, pOOB59 were coexpressed with the rest of moe cluster 1 to study the roles of moeR5 and moeS5 in moe A biosynthesis.

The internal fragment of moeGT3 was amplified with primers GT2con73up and GT2con73rp, digested with HindIII and EcoRI and cloned into respective sites of pKC1139. The resulting plasmid was named pMO12 and used to insertionally inactivate moeGT3 gene within S. ghanaensis ATCC14672 chromosome following the described protocol (Ostash 2007).

There is unique XhoI site in plasmid pMO14 located 205 bp downstream of moeGT3 start codon. Plasmid pMO14 was digested with XhoI, treated with Klenow fragment and ligated to spectinomycin resistance gene aadA (retrieved as DraI fragment from pHP45). The resulting plasmid pOOB58 was used as a source of 3.3 kb XbaI-EcoRI linear moeGT3::aadA fragment to replace the intact moeGT3 in cosmid moeno38-91 (derivative of moeno38-1 with deleted moeGT5; see below) via A-RED approach.

H. Generation of S. ghanaenss and S. lividans Disruption Mutants and their Analysis

The same procedure was applied for all four gene knockouts described below in Section III (moeM5, moe A4, moeGT1 and moeGT3; see e.g., FIG. 6 for an exemplary schematic). Strains carrying pKC1139-based disruption plasmids in replicative form were grown for 3 days in TSB at 30° C. (e.g., strain S. ghanaensis carrying moeM5 disruption plasmid pOOB20a in replicative form). The biomass was then washed three times with water to remove apramycin used for plasmid selection, and approximately 10⁵ colony forming units (“cfu”) were inoculated into fresh TSB (25 mL) without antibiotic. The culture was incubated for 6 days at 40° C. (to eliminate free plasmid), plated onto YMA supplemented with apramycin and grown for 4-5 days at 37° C.

In this way colonies with disruption plasmids integrated into genes of interest—via recombination between regions of homology in the chromosome (full copy of gene of interest) and on the plasmid (the internal fragment of the gene)—were obtained. This integration disrupted the coding region of the gene, leading to deficient strains of S. ghanaensis.

Ten independent colonies for each gene disruption experiment were assayed for moe A production, and in no case was the moe A+ phenotype detected due to possible non-specific integration of plasmid into S. ghanaensis genome. Additionally, the reversions to moe A+ phenotype was not detected when the strains were grown in presence of apramycin at 37° C. indicating that under the stated conditions the insertional inactivation mutants are stable. Passage of wild-type strains under cultivation conditions used to generate moe mutants did not negatively affect moe A production.

The site-specific integration of the disruption sequences was also confirmed by Southern analysis. For moeM5 confirmation, a moeM5 fragment (either radioactively or non-radioactively labeled) was used as a probe. A 2.8 kb XhoI fragment of wild-type digest hybridized with the moeM5 probe, whereas there were two different hybridizing bands in case of moeM5 mutant total DNA XhoI digest. This corresponded to integration of pOOB20a plasmid into the moeM5 gene and introduction of an additional XhoI site into this chromosomal region (See FIGS. 9A-9B).

For moe A4 plasmid pLH1 in S. ghanaensis LH1 strain was confirmed by Southern analysis using DIG-labeled moe A internal fragment as a probe. In wild-type strains, the moe A4 gene resides in a 10 kb BamHI fragment, whereas in the LH strain, the corresponding hybridizing band is absent and a new 19 kb band was present. The latter corresponded to integration of 9 kb pLH1 plasmid into 10 kb BamHI moe A-containing fragment of S. ghanaensis chromosome.

Likewise the integration of the 7 kb moeGT1 disruption plasmid pOOB21e into 10 kb moeGT1-containing BamHI fragment of S. ghanaensis genome was demonstrated (See FIGS. 10A-10B). For moeGT3 disruption, the plasmid pm012 was transferred into S. ghanaensis via conjugation and homologous integration. The integration was verified by Southern analysis.

Derivatives of moeno38-1 carrying the deletions of moe genes were generated via λ-RED approach. The following procedure was used for all λ-RED-assisted deletions of moe genes within cosmid moeno38-1 (except for moeGT3). Briefly, the entire open reading frame(s) was replaced with kanamycin resistance cassette (pKD4). Then the mutated cosmid was introduced into strain DH5a (pCP20) to evict kanR as described (Datsenko 2000). The presence of expected deletions within the cosmids was checked by PCR. λ-RED recombination was used to replace moeGT3 with disrupted allele moeGT3::aadA in ΔmoeGT5 derivative of moeno38-1.

For moeno38-5 (deletion of moeA5moeB5 genes), the kanamycin resistance gene from plasmid pKD4 was amplified with primers 38start-KD4 and moeA4-P3. The resulting amplicon was used to replace moeA4moeB5 gene pair as well as the entire nonessential “left arm” of moeno38-1 (FIGS. 3A-3B). Our previous studies showed that deletion of this arm did not alter moe A production (data not shown). We did not evict kanR gene region from moeno38-5 because it did not exert any negative effects on moe A production. The replacement of moeA5moeB5 genes with kanR in moeno38-5 was confirmed via diagnostic PCR (primers P2-KD4 and alsrev1).

For moeno38-91 (deletion of moeGT5 gene), gene kanR was amplified with primers moeGT5up-P1 and moeGT5rp-P2. This amplicon was used to replace moeGT5 gene. The resulting cosmid moeno38-9 was introduced into E. coli DH5a (pCP20) to excise the kanR in FLP-mediated reaction (Datsenko 2000). The cosmid carrying 81 bp “scar” sequence instead of moeGT5 was named moeno38-91. Deletion of moeGT5 in moeno38-91 was confirmed via PCR (primers moeGT5XbaIup and moeGT5EcoRIrp).

For moeno38-31 (deletion of moeGT4), the cosmid moeno38-31 was constructed in the same way moeno38-91 was. The deletion of moeGT4 was checked by PCR (primers moeGT4XbaIup and moeGT4EcoRIrp).

For moeno38-81 (deletion of moeGT2), the cosmid moeno38-81 was constructed in the same way moeno38-91 was. Deletion of moeGT2 was confirmed via PCR (primers moeGT2XbaIup and moeGT2EcoRIrp).

For moeno38-911 (deletion of moeGT5 and disruption of moeGT3), a 3 kb XbaI-EcoRI fragment containing moeGT3::aadA allele was retrieved from pOOB58 and used to replace moeGT3 in cosmid moeno38-91. The replacement of moeGT3 with moeGT3::aadA in moeno38-911 was verified by PCR (primers moeGT3XbaIup and moeGT3EcoRIrp).

For moeno38-41 (deletion of moeF5), the cosmid moeno38-41 was constructed in the same way moeno38-91 was. The deletion of moeF5 was checked by PCR (primers moeF5check-up and moeF5check-rp).

For moeno38-61 (deletion of moeH5), the cosmid moeno38-61 was constructed in the same way moeno38-91 was. The deletion of moeH5 was checked by PCR (primers moeH5XbaIup and moeH5EcoRIrp).

For moeno38-21 (deletion of moeK5), the cosmid moeno38-21 was constructed in the same way moeno38-91 was. The deletion of moeK5 was checked by PCR (primers moeK5XbaIup and moeK5EcoRIrp).

For moeno38-7 (deletion of moeN5), the gene moeN5 was replaced with the kanR cassette as described above for moeno38-5 construction. We did not excise the kanR cassette from moeno38-7 because it did not exert any polar effects on moe A production.

Gene moeGT3 was insertionally inactivated in S. ghanaensis genome according to established procedure (Ostash 2007). All constructs carrying moe genes were transferred into S. lividans via intergeneric conjugation. Plasmids pUJ584 and pMoeO5extra were introduced via protoplast transformation. Integration of moeno38-1 and its derivatives into S. lividans genome was checked as described in Ostash 2007.

I. Chemicals

Organic solvents, salts, sugars, ITPG, X-Gal and antibiotics were purchased from standard commercial suppliers. The purified samples of moe A have been kindly provided by J. Taylor and S. Fuse (Dept. of Chemistry and Chemical Biology, Harvard University). For recombinant strains selection following commercially available antibiotics were used (mg/mL): ampicillin (100), chloramphenicol (35), kanamycin (50), apramycin (50), hygromycin (100), spectinomycin (200), streptomycin (100), thiostrepton (50), nalidixic acid (50).

J. Moe Production and Resistance Analysis

1. Moe Production in S. ghanaensis

For all moe production analysis procedures, equal amounts of biomass (wet weight) and fermentation medium were used. For moes production, S. ghanaensis strains with disrupted moe genes were grown at 37° C. for 4-5 days in mTSB and for 10 days in LM. For antibiotic disc diffusion assays, fermentation medium and concentrated methanol extracts of moe A from mycelium of S. ghanaensis strains were applied to antibiotic assay discs (diam. 10 mm, Sigma, St. Louis, Mo.) and stacked onto LA plates overlaid with soft agar containing B. cereus. Semipurified samples of moe A and its derivatives were obtained by methanol extraction of mycelium of S. ghanaensis strains and further C18 solid phase extraction as described in (Eichhorn 2005) and then used for LC-MS analysis (Eichhorn 2005) and biochromatography. For the latter, dried silica gel aluminum TLC plate (mobile phase—methanol:acetonitrile:water 40:40:20) with separated moes were overlaid with soft agar containing B. cereus, incubated overnight at 30° C. and then visualized with UV light (254 nm).

For moeM5, S. ghanaensis OB20a was incubated in TSB medium supplemented with apramycin (to select for pOOB20a integration in moeM5) for 4 days at 37° C. and the moes were extracted from mycelium with methanol. The methanol extract was evaporated, and dry residue was dissolved in water and analyzed as noted above, by antibiotic disc diffusion assay, biochromatography and LC-MS.

2. Moe Production in lividans

Heterologous expression of moe biosynthetic genes in S. lividans TK24 leads to the production of moe derivatives and intermediates. Small-scale fermentation and purification of moenomycins was performed according to Ostash 2007. To obtain pure (>90 as judged by TLC) moenomycin intermediates from recombinant S. lividans strains, the following procedure was used. TSB medium (30 mL in 250 mL flask containing 70 glass beads (Ø 5 mm)) was inoculated with 100 μL (approx. 10⁴-10⁵ cfu) of stock culture (kept in 10.3% sucrose at −20° C.). The flask was incubated on orbital shaker (240 rpm) for 2 days at 37° C. and then used as a preculture to start the fermentation. R5 medium (Kieser 2000) in a slightly modified form (sucrose: 6% instead of 10.3%; 1 mg/L CoCl₂ was added after autoclaving) was used as a fermentation medium. 8 4 L flasks (500 mL of medium per each one) containing beads were grown for 6 days at 37° C. The mycelium was harvested by centrifugation and extracted exhaustively with methanol-water (9:1) at 37° C. (when necessary, the pH of extraction mixture was adjusted to 7-7.5 with Tris-HCl). The extract was concentrated in rotovapor, reconstituted in water and extracted with dichloromethane. Aqueous phase was loaded on XAD-16 column (30×400 mm), washed with water (300 mL) and eluted with methanol (500 mL). Methanol fractions containing the desired compound were combined, concentrated and purified on Sep-Pak C₁₈ SPE cartridge (Waters) as described (Eihchorn 2005). Further silica gel flash chromatography or preparative TLC of the obtained extract according to (Adachi 2006) yielded pure compound 0.1-0.4 mg/4 L, depending on strain. Antibiotic disc diffusion assay, LC-MS, MS/MS and determination of accurate mass spectra of moenomycins were carried out as described in (Ostash 2007). ¹H NMR spectra of compound 11 (FIGS. 4A-4D) were recorded on a Varian Inova 500 (500 MHz) instrument in D₂O (4.80 ppm). Chemical shifts are reported in parts per million (ppm) units.

100 μL (approx. 2×10⁶ cfu) of 48 hour liquid cultures of Streptomyces strains were mixed with 4 mL of soft agar and spread on YMA plates. Then 5 mm antibiotic assay discs with different amounts of moe A were placed on top of soft agar. The moe A growth inhibition was monitored after 12, 24, 48, 72 and 96 hours of cultivation at 30° C.

II. Cloning of Moe Biosynthetic Genes from S. ghanaeasis ATCC14672

“Reverse genetics” strategies (Weber 2003) became popular for identification of antibiotic biosynthesis gene clusters which share conserved motifs such as the polyketide synthase or nonribosomal peptide synthase genes. Moe A, too, contains structural elements for which dedicated biosynthetic enzyme activity may be ascribed (See e.g., FIG. 7). Using degenerate primers homologous to conserved regions of aforementioned genes (Decker 1996, Rascher 2003, Kawasaki 2003), a set of DNA fragments encoding candidate moe genes was amplified. However, disruption of these cloned genes in the S. ghanaensis genome showed that none of them is involved in moe production.

As PCR-based approaches did not lead to moe biosynthetic genes, an in silico, whole-genome scanning strategy was used. The genome of S. ghanaensis ATCC14672 (approximately 8.6Mbp) was shot-gun sequenced to 6.6× coverage and partially assembled yielding 1018 contigs ranging from 1 to 95 Kbp in size. (this phase of the investigation was performed in collaboration with Broad Institute; see the trace sequences at “Traces” on the NCBI website). The structure of moe A suggests that clustered glycosyltransferase, sugar production and prenyltransferase genes as well as unknown genes for chromophore and phosphoglycerate unit incorporation would be identified (FIG. 7). Using BLASTX, all contigs provided by Broad Institute were scanned in silico for the presence of such genes and gene clusters. 70 contigs containing at least some of the expected genes were then analyzed in more detail using FramePlot and BLASTP programs. One stand-alone contig, 908, and three adjacent contigs, 71, 72 and 73 were identified as those most probably carrying all or most of the genes required for moe biosynthesis. These contigs were assigned to two different chromosomal locations or clusters (cluster 1 and cluster 2).

On the basis of the contig 908 sequence, primers ligup1HindIII and ligrp1EcoRI were designed to amplify 1039 bp internal fragment of the moe A4 gene (which spans moe A4 coding region from 160aa to 506aa) from S. ghanaensis ATCC14672 genome. Primers GTcon72for and GTcon72rev (designed on the basis of contig 72 sequence) were used to clone a 424 bp internal fragment of moeGT1 gene (164-305aa). A 489 bp fragment of moeGT3 gene (228-390aa) was amplified with primers GT2con73up and GT2con73rp (designed on basis of contig 73 sequence). For primer sequences, see Table 5. DIG-labeled fragments of moe A4, moeGT1, moeGT3 genes were used to probe a S. ghanaensis cosmid library. Positive cosmids moeno38 and moeno40 were found to carry overlapping segments of the S. ghanaensis genome that covered contigs 71, 72, 73; cosmid moeno5 was found to cover contig 908. One cosmid, moeno5, carried 3 moe biosynthetic genes, moe A4, moeB4, and moeC4 (moe cluster 2); the other two cosmids carried the rest of the moe genes (moe cluster 1), see FIGS. 3A-3B.

In total, 43 Kb of the S. ghanaensis chromosome has been sequenced, and 30 open reading frames have been identified. The sequences of cluster 1 and cluster 2 are shown in Table 1 and Table 2, respectively. The open reading frames were found to contain the typical (for Streptomyces) GC bias in the third codon position of around 90%, and typical codon usage. On the basis of homology searches and functional analysis (see Bioinformatics and Genetic Analysis, Part III, below) 23 of the open reading frames were identified as likely to participate in moe biosynthesis (see Table 4 for predicted function).

III. Bioinformatics and Genetic Analysis

The function of each gene and the role it is likely to play in moe synthesis was determined based on both bioinformatics study and genetic analysis. An NCBI Database BLAST search of the protein sequence was performed. The closest homologue alignment, as determined by this search, is presented in the Tables associated with each open reading frame. In the alignment tables, “QUERY” indicates the moe protein sequence and “SUBJECT” indicates the homologue protein sequence. SeQ ID NOS corresponding to each contiguous stretch of amino acids are indicated in the table of above the respective sequence. The genes identified as likely to be material to moe biosyntheses are described below in seven groups (A-G) based on their function: (A) genes for 2-amino-3-hydroxycyclopent-2-enone moiety (CsN unit) biosynthesis and attachment to pentasaccharide moiety of Moe A; (B) glycosyltransferase genes; (C) sugar tailoring genes; (D) genes for phosphoglycerate-lipid moiety biosynthesis; (E) transport genes; (F) genes flanking moe clusters 1 and 2; (G) regulatory genes.

We also used a genetic approach to decipher the moe A biosynthetic pathway. The major moe cluster 1 minus the moeR5moeS5 genes is located on the hygromycin resistant cosmid moeno38-1 (FIG. 3), which directs the production of precursor 19 in S. lividans TK24 (FIGS. 4A-4D) (Ostash 2007). We also constructed a set of moeno38-1 derivatives carrying λ-Red-induced deletions (Datsenko 2000, Gust 2003) of individual moe genes; one double mutant cosmid (ΔmoeGT5ΔmoeGT3) was also created. Gene moeGT3 was disrupted in the moe A producer S. ghanaensis ATCC14672 (strain MO12) as well. Genes moeR5moeS5 are located within the pKCl 139-based plasmid pOOB49f (FIG. 2). Plasmid pOOB64b (based on vector pSOK804 (Sekurova 2004) carries moe cluster 2, an apramycin resistance marker and an actinophage VWB attP-int fragment. Derivatives of moeno38-1 were integrated into the S. lividans attP^(φC31) site and then certain strains were further supplemented with either pOOB49f or pOOB64b, or their truncated versions. The mutations in individual moe genes were complemented with exact copies of the genes, thus ruling out any polar effects. All recombinant S. lividans strains were analyzed following purification by a set of spectroscopic methods and bioassays, which guided our prediction of the structures of moe A derivatives. We abbreviate the names of recombinant S. lividans strains. For example, S. lividans strain carrying moeno38-1 derivative with deleted moeF5 is referred to as ΔmoeF5; expression of moeR5 in ΔmoeH5 strain is marked as moeR5⁺ΔmoeH5; strains carrying the parental cosmid moeno38-1 are marked as 38-1⁺ strains.

A. Genes for 2-Amino-3-Hydroxycyclopent-2-Enone Moiety (C₅N Unit) Biosynthesis and Attachment to Pentasaccharide Moiety of Moe A

Five different genes were identified that fit this functional category; two in cluster 1 (moe A5 and moeB5) and 3 in cluster 2 (moe A4, moeB4 and moeC4). Both moe clusters 1 and 2 carry a copy of putative aminolevulinate synthase gene (moeC4 and moeA5; FIGS. 3A-3B), which is proposed to direct the production of 5-aminolevulinic acid, the putative precursor to the proposed aminocyclopentadione A ring (Ostash 2007).

1. moe A5 (Cluster 1)

The biosynthetic studies led by Floss, Welzel and Felsberg showed that 5-aminolevulinic acid (5-ALA) is a linear precursor of the C₅N chromophore (Nakagawa 1985, Schuricht 2000, Petricek 2006). In moe cluster 1, the moeA5 gene, which displays end-to-end homology to known and putative 5-ALA synthases from various bacteria, was identified. The nucleotide and polypeptide sequences are shown in Tables 6 and 7, respectively.

TABLE 6 DNA Sequence of moeA5. ATGGACTTCTTCGTGCGACTCGCCCGCGAAACCGGTGACCGGAAGAGGGA GTTTCTCGAACTCGGCCGCAAGGCGGGTCGGTTCCCCGCGGCGAGCACCT CGAATGGCGAGATTTCCATCTGGTGCAGCAACGACTACCTGGGTATGGGG CAGCACCCGGACGTCCTCGACGCCATGAAGCGCTCCGTGGACGAATACGG CGGAGGATCCGGGGGTTCGCGGAACACAGGCGGAACCAACCACTTCCATG TGGCTCTGGAGCGGGAGCCGGCCGAGCCGCACGGAAAGGAGGACGCCGTT CTCTTCACCTCGGGGTATTCCGCCAATGAGGGATCCCTGTCGGTTCTGGC CGGGGCCGTCGACGACTGCCAGGTCTTCTCGGATTCGGCGAACCACGCGT CCATCATCGACGGTTTACGGCACAGCGGCGCCCGCAAGCACGTATTCCGG CACAAGGACGGGCGGCATCTGGAGGAGTTGCTGGCCGCGGCCGACCGGGA CAAGCCGAAGTTCATCGCCCTGGAGTCCGTGCATTCGATGCGGGGCGACA TCGCGCTCCTGGCCGAGATCGCCGGCCTGGCCAAGCGGTACGGAGCGGTC ACCTTCCTCGACGAGGTGCACGCGGTCGGCATGTACGGCCCGGGCGGAGC GGGCATCGCGGCCCGGGACGGCGTGCACTGCGAGTTCACGGTGGTGATGG GGACCCTCGCCAAGGCCTTCGGCATGACCGGCGGCTACGTGGCGGGACCG GCCGTGCTCATGGACGCGGTGCGCGCCCGGGCCCGTTCCTTCGTCTTCAC CACGGCGCTGCCGCCGGCGGTCGCGGCGGGCGCGCTCGCCGCGGTGCGGC ACCTGCGCGGCTCGGACGAGGAGCGGCGGCGGCCGGCGGAGAACGCGCGG CTGACGCACGGCCTGCTCCGCGAGCGGGACATCCCCGTGCTGTCGGACCG GTCCCCCATCGTCCCGGTGCTGGTCGGCGAGGACCGGATGTGCAAGCGCA TGTCGGCCCTGCCGCTGGAGCGGCACGGCGCGTACGTCCAGGCCATCGAC GCGCCCAGCGTCCCGGCCGGCGAGGAGATCCTGCGGATCGCGCCCTCGGC GGTGCACGAGACCGAGGAGATCCACCGGTTCGTGGACGCCCTGGACGGCA TCTGGTCCGAACTGGGGGCCGCCCGGCGCGTCTGA (SEQ ID NO: 3)

TABLE 7 Amino Acid Sequence of moeA5 MDFFVRLARETGDRKREFLELGRKAGRFPAASTSNGEISIWCSNDYLGMG QHPDVLDAMKRSVDEYGGGSGGSRNTGGTNHFHVALEREPAEPHGKEDAV LFTSGYSANEGSLSVLAGAVDDCQVFSDSANHASIIDGLRHSGARKHVFR HKDGRHLEELLAAADRDKPKFIALESVHSMRGDIALLAEIAGLAKRYGAV TFLDEVHAVGMYGPGGAGIAARDGVHCEFTVVMGTLAKAFGMTGGYVAGP AVLMDAVRARARSFVFTTALPPAVAAGALAAVRHLRGSDEERRRPAENAR LTHGLLRERDIPVLSDRSPIVPVLVGEDRMCKRMSALPLERHGAYVQAID APSVPAGEEILRIAPSAVHETEEIHRFVDALDGIWSELGAARRV (SEQ ID NO: 26)

Two of the closest moe A5 homologues are found in streptomycetes-producers of the C₅N-containing antibiotics asukamycin and ECO-02301 (64% identity and 78% similarity) (Petricek 2006, McAlpine 2005). The sequence alignment is shown in Table 8. No moe A5-like genes were identified in the completely sequenced S. coelicolor and S. avermitilis genomes, suggesting that in Streptomyces, 5-ALA synthases control 5-ALA supply strictly for C₅N anabolism.

TABLE 8 Sequence Homology of moeA5 gi|37932054|gb|AAO62615.1| aminolevulinate synthase [Streptomyces nodosus subsp. asukaensis] Length = 409 Score = 460 bits (1183), Expect = 3e−127 Identities = 258/398 (64%), Positives = 313/398 (78%), Gaps = 6/398 (1%) Frame = +3 SEQ ID NO: 107                                SEQ ID NO: 108 Query 375 ISSSMDFFVRLARETGDRKREFLELGRKAGRFPAASTSNG------EISIWCSNDYLGMG 536 ++  +||| |   | | |+|||||+||+|||||+|    |      |||+|||||||||| Sbjct 1 MNKHLDFFAREMEEFGARRREFLEIGRRAGRFPSAVARQGQDGTDVEISVWCSNDYLGMG 60 SEQ ID NO: 109 Query 537 QHPDVLDAMKRSVDEYgggsggsrntggtnHFHVALEREPAEPHGKEDAVLFTSGYSANE 716 |+| ||+|+| +|| +| ||||||| |||||+|| || | |  ||||+|++| ||++||+ Sbjct 61 QNPFVLEAVKNAVDAFGAGSGGSRNIGGTNHYHVLLENELAALHGKEEALIFPSGFTAND 120 Query 717 GSLSVLAGAVDDCQVFSDSANHASIIDGLRHSGARKHVFRHKDGRHLEELLAAADRDKPK 896 |+|+||||      |||  |||||||||||||| | +||| |  |||||||||| ++|| Sbjct 121 GALTVLAGRAPGTLVFSDELNHASIIDGLRHSGAEKRIFRHNDMAHLEELLAAADPERPK 180 Query 897 FIALESVHSMRGDIALLAEIAGLAKRYGAVTFLDEVHAVGMYGPGGAGIAARDGVHCEFT 1076  | ||||+|| |||| ||| | ||+|+|| ||+||||||||||| |||||||+|+  ||| Sbjct 181 LIVLESVYSMSGDIAPLAETAALARRHGATTFIDEVHAVGMYGPQGAGIAAREGIADEFT 240 Query 1077 VVMGTLAKAFGMTGGYVAGPAVLMDAVRARARSFVFTTalppavaagalaavRHLRGSDE 1256 |||||||| ||  |||+|||| |+||||  +| |+ |+|||++||| ||||||||+||| |+ Sbjct 241 VVMGTLAKGFGTAGGYIAGPAALIDAVRNFSRGFIFTTSIPPATAAGALAAVQHLRASEG 300 Query 1257 ERRRPAENARLTHGLLRERDIPVLSDRSPIVPVLVGEDRMCKRMSALPLERHGAYVQAID 1436 || | | || | | ||+||||| +||+| || | ||+| +|++ ||| ||||| ||| |+ Sbjct 301 ERTRLAANAGLLHRLLKERDIPFVSDQSHIVSVFVGDDGLCRQASALLLERHGIYVQPIN 360 Query 1437 APSVPAGEEILRIAPSAVHETEEIHRFVDALDGIWSEL 1550 |||| |||||||+|||| | | ++ +| +|++||| +| Sbjct 361 APSVRAGEEILRVAPSATHTTGDVEKFAEAVEGIWRDL 398

Coexpression of different truncated variants of moe clusters 1 and 2 has revealed that moeA5 is nonfunctional and that the moe cluster 2 genes are sufficient to convert the precursor 19 into pholipomycin 21 (FIGS. 4A-4D). To probe whether unit A (FIG. 1) originates from 5-aminolevulinate produced by moeC4, as suggested, we fed 5-aminolevulinate to the moeB4⁺moeA4⁺ 38-1⁺ strain. Pholipomycin was not detected in cell extracts in our assay (data not shown). It was reported recently that similar supplementation of mutant asukamycin producers with aminolevulinate failed to yield unit A-tailored antibiotics (Petricek 2006). Perhaps 5-aminolevulinate is not the precursor for C5N units in secondary metabolites such as asukamycin and moenomycin. In any event, although the genes for unit A biogenesis have been identified, the biochemistry of synthesis and attachment remains obscure.

2. moeB5 (Cluster 1)

The moeB5 gene is located near the moe A5 gene. MoeB5 appears to have homology to the C-terminal portion of an acyl-CoA ligase gene (56% identity and 71% similarity to S. coelicolor homologue SCO6968). However, due to a large deletion of the central portion of the moeB5 gene (relative to a full-length acyl-CoA ligase gene) it is unlikely that moeB5 encodes a functional ligase, even though all the features of open reading frame are present. The nucleotide and polypeptide sequences are shown in Tables 9 and 10, respectively. A sequence alignment between moeB5 and the closest homolog identified in the BLAST search is shown in Table 11.

TABLE 9 DNA Sequence of moeB5. GTGGGCGGGCCCGGGGGCGACCCTCTCGGCGGCCACGATCCTCTCGGACT CCGCGGGCCGGTGGCCGGAGCGCACCGCGGTGGTCGCGGGCGCCGAGCGG ATCACCTCTGGGGCGTGGAGGTCGCGACAGCCCCGGTCCGAGGCGGAGGA CGCCGTCGGGCCGCTCCCTCCGGCGAGGTGGGCGAGATCGTCGTCCGCGG GCACAACCTGATGGCCGGGTACGTCGACGCCCCCCGCGCCACGGCCGCCG CGTTCGTGGACGGCTGGTTCCGCACCGGCGATCTAGGGCTGCTGGACGAG GAGGGGTACCCCACCGCCGTCGACCGCGAGAAGGACGTGATCCTGCGGGG CGGGTACGACGTCCATCCCCGTGAGGTCGAGGAAGCGCTGCTCCGCCATC CGGCGGTCGCCCGGGTCGCGGTGGTGGGGCTCCCCGACCCGGTGTACGGC CAGGAGGTGTGCGCGGTGGTGGTGCCACGGGACGGCCCGACACCGGAGGG GGCACTGGCGGATTCCGTCGTGGCGTGGGGTGAGCGGCACATCGCGGCGT ACCGGCGTCCGCGGCGGGTGGTCCTCCCCGACCGGCTTCCCCTGGGACCC GGCGGCAAGGTCCTCAAGGGGGAGCCGGCCGTCCGGCTCCGGTCGTCCGA CGAGGCGGGGGCGGCCCGGCCGAGGGGTGACGGCCCCGGCCGGTTCCCCG CCGGCGGGGGCGGCCCGGCCCGGACGAACGCCTCGGAGGCGGTGCGCGCC GCCCGGTCCGTGTCCCCGCCCGGGGCCGGTGTCCGGCGCGCTCAGTCGGT GAGCCCCAGCGTCCCGGCCGCCTGGATCGCCAGCCAGACCTCCGCCAGCG CGCCGGTCGAGGACAGGTCGCGCCGGGAGAGGGCGCCGAAGCGGCGCAGC CGGTAG (SEQ ID NO: 4)

TABLE 10 Amino Acid Sequence of moeB5 VGGPGGDPLGGHDPLGLRGPVAGAHRGGRGRRADHLWGVEVATAPVRGGG RRRAAPSGEVGEIVVRGHNLMAGYVDAPRATAAAFVDGWFRTGDLGLLDE EGYPTAVDREKDVILRGGYDVHPREVEEALLRHPAVARVAVVGLPDPVYG QEVCAVVVPRDGPTPDGALADSVVAWGERHIAAYRRPRRVVLPDRLPLGP GGKVLKGEPAVRLRSSDEAGAARPRGDGPGRFPAGGGGPARTNASEAVRA ARSVSPPGAGVRRAQSVSPSVPAAWIASQTSASAPVEDRSRRERAPKRRS R (SEQ ID NO: 27)

TABLE 11 Sequence Homology of moeB5 gb|AAX98210.1|acyl CoA ligase [Streptomyces aizunensis] Length = 506 Score = 197 bits (502), Expect = 3e−49 Identities = 98/175 (56%), Positives = 122/175 (69%), Gaps = 0/175 (0%) SEQ ID NO: 110 Query 38 GVEVATAPVRGGGRRRAAPSGEVGEIVVRGHNLMAGYVDAPRATAAAFVDGWFRTGDLGL 97 || || |     || |    |++||||| |||+||||+  |+ ||   |||||||||+|+ Sbjct 330 GVRVAIADAELEGRIRLLKQGDIGEIVVSGHNVMAGYLGRPQETAEVLVDGWFRTGDMGV 389 SEQ ID NO: 111 Query 98 LDEEGYPTAVDREKDVILRGGYDVHPREVEEALLRHPAVARVAVVGLPDPVYGQEVCAVV 157  ||+|| + |||+||+|+||||+|+|||||+ |||||||    |||+|   +|+||||| Sbjct 390 QDEDGYLSIVDRKKDMIVRGGYNVYPREVEDVLLRHPAVDGACVVGVPSVKHGEEVCAVV 449 Query 158 VPRDGPTPDGALADSVVAWGERHIAAYRRPRRVVLPDRLPLGPGGKVLKGEPAVR 212   + |    | ||+ +|||   |+|||+ ||||   +  |||  ||||| | | | Sbjct 450 RVKPGQRASGLLAEEIVAWSRVHMAAYKYPRRVEFVETFPLGSSGKVLKRELAHR 504

Like moe A5, moeB5 was shown to be nonfunctional in the course of heterologous expression of engineered moe cosmids. Thus, the likely lack of function of moeB5, along with the absence of dedicated amide synthase gene for C₅N unit transfer to pentasaccharide moiety in moe cluster 1 led to additional in silico searches. In these additional searches, a three-gene operon (named moe cluster 2) similar to that found in genomes of asukamycin and ECO-02301 producers (McAlpine 2005, Petricek 2006) was identified.

3. meeC4 (Cluster 2)

A second 5-ALA synthase encoding gene, moeC4, was identified in the moe cluster 2 (76.7% similarity between translation products of moe A5 and moeC4). The nucleotide and polypeptide sequences are shown in Tables 12 and 13, respectively. A sequence alignment between moeC4 and the closest homolog identified in the BLAST search is shown in Table 14. As described in Section III.A.1 above, moeC4 is involved in the production of 5-aminolevulinate.

TABLE 12 DNA Sequence of moeC4 GTGACGACCCAATATCTGGATCTCTTTGCACGCCTCACAGAAAACTCCGA CGGGGGAAAGAGGGAGTTCCTGGAGATCGGACGGCTCGCCGGGAGCTTCC CCGCGGCCAGCGTCCGCAGCAGTGGACCCGTGACCGGCCGGGACAGCATC AGCGTCTGGTGCAGCAACGACTACCTCGGCATGGGCCAGCATCCCGCAGT GCTCAAAGCCATGAAGGACGCGATCGACGAGTACGGCGCCGGCGCCGGCG GCTCACGCAACATCGGCGGCACCAACCACTACCACGTGCTGCTGGAGAGA GAGCTCGCCGCGCTCCACGGCAAGGACGAGGCCCTGCTGTTCACCTCCGG TTACACCGCCAACGACGGTGCGCTGTCCGTCATCGCCGGCCGCATGGAGA AGTGTGTCGTCTTCTCCGACGCACTCAACCACGCGTCCATCATCGACGGC CTGCGCCACAGCCGCGCCCAGAAGCAGATCTTCCGCCACAACGACCCCGC TCACCTGGAAGAACTGATAGCGGCGGCCGACCCCGACGTCCCCAAGCTCA TCGTCGCCGAGTCCGTGTACTCGATGAACGGCGAGATCGCCCCGCTGTCC GAAATCGCCGACATCGCCAAGCGCCACGGGGCGATGACGTAGCTCGACGA GGGAGGGCATCGCCGACGACTTCACCGTCATCATGGGCACCTTGGCCAAG GGTTTCGGCACCACCGGCGGCTACATCGCAGGGCCCGCCGAAATCATCGA GGCGGTGCGCATGTTCTCCCGCTCCTTCGTCTTCACCACCGCGCTGGCGC CGGCCGTGGCCGCCGGCGCCCTGGCAGCCGTACACCATCTGCGGTCCTCC GAGGTCGAGCGGGAACAGCTCTGGTCGAACGCGCAGTTGATGCACCGGCT GCTGAACGAGCGTGGCATCCCCTTCATTTCGGACCAGACGCACATCGTGT CCGTCATGGTGGGGGACGAGGCCGTGTGCAAGCGGATGTCCGCGCTGCTG CTCGACCGGCACGGAATCTACGTGCAGGCGATCAACGCGCCGAGCGTGCG GGTCGGTGAGGAGATCCTGCGGGTCGCCCCCGGAGCCGTGCACACCGCCG ACGAGGTACGCGAATTCGTCGAGGCTCTGAGCCAGGTCTGGGAGGAAGTG GGCTCCGCCCGCGTGCCGGCGACCCCGGCCGCTCTCTGA (SEQ ID NO: 5)

TABLE 13 Amino Acid Sequence of moeC4 VTTQYLDLFARLTENSDGGKREFLEIGRLAGSFPAASVRSSGPVTGRDSI SVWCSNDYLGMGQHPAVLKAMKDAIDEYGAGAGGSRNIGGTNHYHVLLER ELAALHGKDEALLFTSGYTANDGALSVIAGRMEKCVVFSDALNHASIIDG LRHSRAQKQIFRHNDPAHLEELIAAADPDVPKLIVAESVYSMNGDIAPLS EIADIAKRHGAMTYLDEVHAVGMYGPEGAGIAAREGIADDFTVIMGTLAK GFGTTGGYIAGPAEIIEAVRMFSRSFVFTTALAPAVAAGALAAVHHLRSS EVEREQLWSNAQLMHRLLNERGIPFISDQTHIVSVMVGDEAVCKRMSALL LDRHGIYVQAINAPSVRVGEEILRVAPGAVHTADDVREFVDALSQVWEEV GSARVPATPAAL (SEQ ID NO: 28)

TABLE 14 Sequence Homology of moeC4 |AAO62615.1|aminolevulinate synthase; Streptomyces nodosus subsp. asukaensis Length = 409 Score = 570 bits (1470), Expect = 3e−161 Identities = 283/401 (70%), Positives = 336/401 (83%), Gaps = 1/401 (0%) SEQ ID NO: 112 Query 4 QYLDLFARLTENSDGGKREFLEIGRLAGSFPAASVRSSGPVTGRDSISVWCSNDYLGMGQ 63 ++|| |||  |     +|||||||| || ||+|  |     |  + |||||||||||||| Sbjct 3 KHLDFFAREMEEFGARRREFLEIGRRAGRFPSAVARQGQDGTDVE-ISVWCSNDYLGMGQ 61 SEQ ID NO: 113                                SEQ ID NO: 239 Query 64 HPAVLKAMKDAIDEYGAGAGGSRNIGGTNHYHVLLERELAALHGKDEALLFTSGYTANDG 123 +| ||+|+|+|+| +|||+||||||||||||||||| ||||||||+|||+| ||+||||| Sbjct 62 NPFVLEAVKNAVDAFGAGSGGSRNIGGTNHYHVLLENELAALHGKEEALIFPSGFTANDG 121 Query 124 ALSVIAGRMEKCVVFSDALNHASIIDGLRHSRAQKQIFRHNDPAHLEELIAAADPDVPKL 183 ||+|+|||    +|||| ||||||||||||| |+|+|||||| ||||||+|||||+ ||| Sbjct 122 ALTVLAGRAPGTLVFSDELNHASIIDGLRHSGAEKRIFRHNDMAHLEELLAAADPERPKL 181 Query 184 IVAESVYSMNGDIAPLSEIADIAKRHGAMTYLDEVHAVGMYGPEGAGIAAREGIADDFTV 243 || ||||||+||||||+| | +|+|||| |++|||||||||||+||||||||||||+||| Sbjct 182 IVLESVYSMSGDIAPLAETAALARRHGATTFIDEVHAVGMYGPQGAGIAAREGIADEFTV 241 Query 244 IMGTLAKGFGTTGGYIAGPAEIIEAVRMFSRSFVFTTALAPAVAAGALAAVHHLRSSEVE 303 +|||||||||| |||||||| +|+||| ||| |+|||++ || |||||||| |||+|| | Sbjct 242 VMGTLAKGFGTAGGYIAGPAALIDAVRNFSRGFIFTTSIPPATAAGALAAVQHLRASEGE 301 Query 304 REQLWSNAQLMHRLLNERGIPFISDQTHIVSVMVGDEAVCKRMSALLLDRHGIYVQAINA 363 | +| +|| |+|||| || |||+|||+||||| |||+ +|++ |||||+||||||| ||| Sbjct 302 RTRLAANAGLLHRLLKERDIPFVSDQSHIVSVFVGDDGLCRQASALLLERHGIYVQPINA 361 Query 364 PSVRVGEEILRVAPGAVHTADDVREFVDALSQVWEEVGSAR 404 |||| ||||||||| | ||  || +| +|+  +| ++|  | Sbjct 362 PSVRAGEEILRVAPSATHTTGDVEKFAEAVEGIWRDLGIPR 402

Based on the studies of the present invention, it is propose that the polypeptide encoded by the moeC4 gene is an aminolevulinate synthase which participates in the conversion of Moe intermediate compound 18 or 19 in the course of moenomycin biosynthesis to yield a Moe compound MmA or 21, as shown in FIGS. 4A-4D.

4. moe A4 (Cluster 2)

Also identified in cluster 2 was the moe A4 gene, the translation product of which shows end-to-end homology to acy-CoA ligases (63% identity and 73% similarity to hypothetical acyl-CoA ligase from S. aizunensis). The nucleotide and polypeptide sequences are shown in Tables 15 and 16, respectively. A sequence alignment between moe A4 and the closest homolog identified in the BLAST search is shown in Table 17. The moe A4 protein may be involved in the formation of 5-ALA coenzyme A ester, a putative prerequisite for its intramolecular cyclization.

TABLE 15 DNA Sequence of moeA4 ATGACCCTGACCGCCGCGTCCGTACTGGCCGAGTCCGCCGGGCGACGCCC CGACCACCCCGCGCTCGTCTTCGGCTCCGAACGCATCACCTACGCCGAGC TCTGGCTCGCAACCCGCCGGTACGCGGCGGTGCTGAGGGACCGCGGTGTG CGCCCGGGCGACCGGATCGCCCTGCTGCTGCCGAACACACCGCACTTCCC GATGGTGTACTACGGCGTGCTGGCGCTCGGTGCCGTGGTGGTCCCGGTGC ACGGCCTGCTGCGTGCCGACGAGATCGTCCACGTGCTGGGCGAGTCCGAG GCGAAGGCCATGGTGTGCGCGGCCCCGATGCTGACCGAGGGCGCCAAGGC GGCCGGGACGGCCGGGGTTCCGCTGCTCACCGTCATGGTCGAGAACGGCG AGGACGACGACGGCCCGGCACGCCTCGACGTGCTCGCCGAACGGGCGGAG CCCCTGGACGGTCTGGTGCCGCGCGCGCCCGACGACTTGGCCTTGGTGCT GTACACCTCGGGCACCACCGGCCGGCCCAAGGGCGCGATGATCACCCACC TCAACCTGGTGATGAACGTCAGCACCACGATGCGCTCGCCGTTCGACCTC GGCCCCGAGGACGTGCTGCTGGGCTGTCTGCCGCTGTTCCACACCTTCGG CCAGACCTGCGGCATGAGCGCCTGTTTCCTGGCCGGCGGCACCCTGGTGC TCATGAACCGCTTCGACGGCCCCGGCGCGCTCGACCTCATGGTCACCGAG GGCTGCACGGTGTTCATGGGCGTCCCGACCATGTACCTGGCCCTCCTCGA CGCCGCCGCTCACGACGCCCGCCGCCCCGTGCTCGACCGCGCCTTCTCCG GCGGTTCGGCGCTACCGGTCAAGGTGCTCGAGGAGTTCCAGGAGGTCTAC GGCTGCCCGATCTACGAGGGGTACGGCCTCACGGAGACCTCGCCGGTGGT GGCGTACAACCAGAAGGCGTGGCCGCGCAGGCCCGGCACCGTGGGGCGCC CCATCTGGGGCGTGGAGGCGGAGATCGCCGCCGCCGACGTGGAGGACCGT ATCGAGCTGCTGCCGGCCGGGGAGATCGGGGAGATCGTCGTACGCGGCCA CAACGTCATGGCCGGCTACCTCAACCGGCCGGAAGCCACCGCAGCCGTGC TGGTCGACGGCTGGTTCCGCTCGGGCGACCTGGGGATGAAGGACGCCGAC GGCTATCTGACCATCGTCGACCGCAAGAAGGACATGGTGCTGCGCGGTGG CTACAACGTCTATCCACGCGAGGTGGAGGAGGTGCTGATGCGTCACCCGG CCGTCGCCCAGGTTGCCGTCATCGGTGTCCCCGACGACAAGTACGGCGAG GAGGTGTGCGCCGTGGTGCGGACGCGGCCGGGCACGGATCCGGACGCGGC GCTGGCCGCGCACATCGTGTCCTGGAGCAGGCAGCGAATCGCCGCGTACA AGTACCCGCGCCGGGTGGAGTTCGTCGAGGACTTCCCCCTCGGGCCGAGC GGCAAGGTACTGAAACGCGAACTCGCCGCCCGCTTCGCCGGCGGTGGCTG A (SEQ ID NO: 6)

TABLE 16 Amino Acid Sequence of moeA4 MTLTAASVLAESAGRRPDHPALVFGSERITYAELWLATRRYAAVLRDRGV RPGDRIALLLPNTPHFPMVYYGVLALGAVVVPVHGLLRADEIVHVLGDSE AKAMVCAAPMLTEGAKAAGTAGVPLLTVMVENGEDDDGPARLDVLAERAE PLDGLVPRAPDDLALVLYTSGTTGRPKGAMITHLNLVMNVSTTMRSPFDL GPEDVLLGCLPLFHTFGQTCGMSACFLAGGTLVLMNRFDGPGALDLMVTE GCTVFMGVPTMYLALLDAAAHDARRPVLDRAFSGGSALPVKVLEEFQEVY GCPIYEGYGLTETSPVVAYNQKAWPRRPGTVGRPIWGVEAEIAAADVEDR IELLPAGEIGEIVVRGHNVMAGYLNRPEATAAVLVDGWFRSGDLGMKDAD GYLTIVDRKKDMVLRGGYNVYPREVEEVLMRHPAVAQVAVIGVPDDKYGE EVCAVVRTRPGTDPDAALAAHIVSWSRQRIAAYKYPRRVEFVEDFPLGPS GKVLKRELAARFAGGG (SEQ ID NO: 29)

TABLE 17 Sequence Homology of moeA4 gb|AAX98210.1|acyl CoA ligase [Streptorayces aizunensis] Length = 506 Score = 624 bits (1610), Expect = 3e−177 Identities = 326/513 (63%), Positives = 379/513 (73%), Gaps = 7/513 (1%) SEQ ID NO: 114 Query 1 MTLTAASVLAESAGRRPDHPALVFGSERITYAELWLATRRYAAVLRDRGVRPGDRIALLL 60 || + |+|||||||| |   ||| |+|||+|| ||   ||||| || +|+ | |++|||+ Sbjct 1 MTRSVAAVLAESAGRWPSRTALVCGAERISYARLWDRARRYAAALRGQGIGPDDKVALLM 60 SEQ ID NO: 115 Query 61 PNTPHFPMVYYGVLALGAVVVPVHGLLRADEIVHVLGDSEAKAMVCAAPMLTEGAKAAGT 120 |||| |  ||+ |||||||||||| ||+  |+ |+| || |+|+| |  +  | |+ || Sbjct 61 PNTPEFAAVYFAVLALGAVVVPVHTLLKPAEVSHLLRDSGARALVWAGTLPQETARDAGE 120 Query 121 AGVPLLTVMVENGEDDDGPARLDVLAERAEPLDGLVPRAPDDLALVLYTSGTTGRPKGAM 180  || ||||    ||   |   ||   +  ||+|  | |  |||||||||||||||||||| Sbjct 121 TGVLLLTV----GEALHGSVLLD---DGVEPIDTYVERGADDLALVLYTSGTTGRPKGAM 173           SEQ ID NO: 116  SEQ ID NO: 117 Query 181 ITHLNLVMNVSTTMRSPFDLGPEDVLLGCLPLFHTFGQTCGMSACFLAGGTLVLMNRFDG 240 +|| |+  |++ |  |||  | +||||| ||| ||||| |||+  | || |||+| ||+ Sbjct 174 LTHGNVATNIAVTAVSPFAFGEDDVLLGALPLSHTFGQICGMAVTFHAGATLVVMERFEA 233 Query 241 PGALDLMVTEGCTVFMGVPTMYLALLDAAAHDARRPVLDRAFSGGSALPVKVLEEFQEVY 300   || ||   |||||||||||| |||+| |  |  | | | +|||||||| ||+  +  + Sbjct 234 HDALRLMREHGCTVFMGVPTMYHALLEAVAAGAPAPRLTRVYSGGSALPVPVLDRVRAAF 293 Query 301 GCPIYEGYGLTETSPVVAYNQKAWPRRPGTVGRPIWGVEAEIAAADVEDRIELLPAGEIG 360 || +||||||||||| |||||   | +||||| || ||   || |++| || ||  |+|| Sbjct 294 GCEVYEGYGLTETSPCVAYNQPGIPCKPGTVGLPIDGVRVAIADAELEGRIRLLKQGDIG 353 Query 361 EIVVRGHNVMAGYLNRPEATAAVLVDGWFRSGDLGMKDADGYLTIVDRKKDMVLRGGYNV 420 |||| ||||||||| ||+ || ||||||||+||+|++| ||||+||||||||++|||||| Sbjct 354 EIVVSGHNVMAGYLGRPQETAEVLVDGWFRTGDMGVQDEDGYLSIVDRKKDMIVRGGYNV 413 Query 421 YPREVEEVLMRHPAVAQVAVIGVPDDKYGEEVCAVVRTRPGTDPDAALAAHIVSWSRQRI 480 ||||||+||+|||||    |+|||  |+||||||||| +||      ||  ||+|||  + Sbjct 414 YPREVEDVLLRHPAVDGACVVGVPSVKHGEEVCAVVRVKPGQRASGLLAEEIVAWSRVHM 473 Query 481 AAYKYPRRVEFVEDFPLGPSGKVLKRELAARFA 513 ||||||||||||| |||| |||||||||| |+| Sbjct 474 AAYKYPRRVEFVETFPLGSSGKVLKRELAHRYA 506

To evaluate the function of moe A4 and to show that the genes in cluster 2 are used in moe biosynthesis, a moe A4 knockout strain of S. ghanaensis was generated. The mutant S. ghanaensis, termed LH1, did not produce moe A. Instead, it accumulated an antibacterially active moe intermediate. Testing via TLC, UV absorption and mass-spectrometry showed that the intermediate is identical to previously described moe A lacking the chromophore unit (Zehl 2006; see Materials & Methods).

The introduction of a functional copy of the moe A4 gene in trans into the LH1 mutant strain restored moe A production. Thus, the moe A4 knockout did not appear to alter the expression of other genes in the mutant. In sum, the data confirm that the moe A gene is used for C₅N unit formation during moes biosynthesis.

Based on the studies of the present invention, it is propose that the polypeptide encoded by the moeA4 gene is an acyl CoA ligase which participates in the conversion of Moe intermediate compound 18 or 19 in the course of moenomycin biosynthesis to yield a Moe compound MmA or 21, as shown in FIGS. 4A-4D.

5. moeB4 (Cluster 2)

The moeB4 gene has been identified as a putative amide synthase gene, and is homologous to different AMP-dependent synthetases and ligases, particularly to aminocoumarin ligase SimL (45% identity and 62% similarity), The nucleotide and polypeptide sequences are shown in Tables 18 and 19, respectively. A sequence alignment between moeB4 and the closest homolog identified in the BLAST search is shown in Table 20. The stop codon of the moeB4 gene overlaps the start codon of moe A4 by one nucleotide, and these two genes are transcribed divergently with respect to the moeC4 transcriptional direction. It is probable that moeB4 can transfer a CsN unit onto the saccharide scaffold of moe A, similarly to the amide synthases involved in aminocoumarin antibiotics biosynthesis.

TABLE 18 DNA Sequence of moeB4 ATGTCCTCGAACGAGAATTACGTCCGCCGGGTGCTTGAGGCGCTGGCCTC CGACCCCGACCGGATTGCCCTGTGGGCGGATGGTGAAGAAATCACCGCGG GCCAGGTCTCCAGGGCGGTTCTCACGGCAGCGGAACTTCTGCTCCGGCAC TTCACGGAACATCGAGACCCGAGTGCGGAAGGCAAGGCCCCGGTTGTGGC GGTGCTGACCGTCACCAACAGCCCGGCGACCATCATCCTCCGCTACGCGG CCAACCTGGCCGGGGCCACCCTGGTCCACCTGCACTCCACGAACGCGGTG GACCCCACCGACCAGCTGGCCGCCGCCGCCCGGCTGGACATTCTCAGCAA GACCGGGGCGACCTTCCTCGCCGTCGACAAGGAGAACCTCGACGCGGCCC GAGAGCTGTGCGACCGGCTGCCCGAGCCACCGCGTCTCGCCGCTCTCGGT GCCCTCGGCCCCGATGTCCTGGACCTCTCGTCGGGCGACCCGGACGCCTT CGGCCACGACGCCGTCGAGGCCGACCCCGAACAGCCGGCCGTGGTGATCT ACACCAGCGGTACCAGCGGACGTCCCAAGGGTGTCACGCAGCCGTACCGC CTTCGCCGTGCCAACCTCCAAGTGGCCCTCCAGTCCCCCGAACCCATCGT GTACCTGTCGACCCTGCCGGTGAGCAACTCCAGCGGCTCCGCCGTCGACG TCGCGCTCGCCTCCGGCGGAACGGTCGTCCTGCACGACGGGTTCGAGGCG GGCGAAGTGCTGCGGGCCGTGGAACAGCACCGCGTCTCCACGCTGACCAT CACCCCGCCGCAGCTGTACATGCTGATCGACCACCCCGACACCGCCACCA CCGACCGTTCGAGCATCAGGCTCATCACCTACCTCGGTTCCCCCGCGGCC CCCGCCCGACTGGCCGAGGCGGTCGAGGTGTTCGGCCCGGTGTTGCTCCA GCTCTACGGGACCACGGAAGTCAACGGCATCAGCATGCTGATGCCGCAGG ACCACTTCGACCCGGAACTGCGCCGGACCGTCGGACGTCCGACCACGGAG ATACGCATCCGCGACGTGGACGACGACCGCGACCTGCCGCCCGGCGAGAT CGGCGAGGTGTGCGTGCAGAGCCCGTCCACCATGCTCGGCTACTGGGGCG AACCGGAGCTGACCGCCGCGATCATCCGCGACGGCTGGGTGCACACCGGC GACCTCGGTTCCCTCGACGAGAACGGCTTTCTGCGCCTGCACGGCCGCAT GGGCGAGGTGATGAAGACCAACGGCATCAAGGTCCATCCCACCGATGTGG AGAACGCGCTGCTGACCCATCCGGAGGTCACCCAGGCCGCTGTGTACTGC GTGGTCGACGAGGACCGCGTGGAGCACATCCACGCCGCCGTCGTGGTACG GCCGGGCGGCACCGCCGACTCCGGGACGCTGATCGGCCACGTCGCCGCCG AGCTGTCTCCGAAGCACGTACCGGCCGTGGTGACGTTCCACGACGCGCTG CCCCTCACCCGTGCCGGAAAACCGGACAAGCCGGCGCTGGCCGCACGGCA CAACGGTGCGGCATGA (SEQ ID NO: 7)

TABLE 19 Amino Acid Sequence of moeB4 MSSNENYVRRVLEALASDPDRIALWADGEEITAGQVSRAVLTAAELLLRH FTEHRDPSAEGKAPVVAVLTVTNSPATIILRYAANLAGATLVHLHSTNAV DPTDQLAAAARLDILSKTGATFLAVDKENLDAARELCDRLPEPPRLAALG ALGPDVLDLSSGDPDAFGHDAVEADPEQPAVVIYTSGTSGRPKGVTQPYR LRRANLQVALQSPEPIVYLSTLPVSNSSGSAVDVALASGGTVVLHDGFEA GEVLRAVEQHRVSTLTITPPQLYMLIDHPDTATTDRSSIRLITYLGSPAA PARLAEAVEVFGPVLLQLYGTTEVNGISMLMPQDHFDPELRRTVGRPTTE IRIRDVDDDRDLPPGEIGEVCVQSPSTMLGYWGEPELTAAIIRDGWVHTG DLGSLDENGFLRLHGRMGEVMKTNGIKVHPTDVENALLTHPEVTQAAVYC VVDEDRVEHIHAAVVVRPGGTADSGTLIGHVAAELSPKHVPAVVTFHDAL PLTRAGKPDKPALAARHNGAA (SEQ ID NO: 30)

TABLE 20 Sequence Homology of moeB4 gb|AAK06803.1|putative aminocoumarin ligase SimD5 [Streptomyces antibioticus] Length = 519 Score = 424 bits (1089), Expect = 7e−117 Identities = 236/517 (45%), Positives = 322/517 (62%), Gaps = 12/517 (2%) SEQ ID NO: 118 Query 1 MSSNENYVRRVLEALASDPDRIALWADGEEITAGQVSRAVLTAAELLLRHFTEHRDPSAE 60 |  ||+|||++|  | +||  +||      + || ++ ++ +||| +          | Sbjct 1 MEGNEHYVRQILNTLRADPSGVALVHRDTPVIAGDLADSITSAAEAMRG--------SGV 52 SEQ ID NO: 119 Query 61 GKAPVVAVLTVTNSPATIILRYAANLAGATLVHLHSTNAVDPTDQLAAAARLDILSKTGA 120 |   || +||  |+|||++ |||||| |||+|||   || +|+| |+| |+  |+++ Sbjct 53 GVGSVVGILTDPNTPATLVARYAANLLGATVVHLFGVNAANPSDLLSAEAQGGIVAEALP 112 SEQ ID NO: 120 Query 121 TFLAVDKENLDAARELCDRLPEPPRLAALGALGPDVLDLSSGDPDAFGHDAVEADPEQPA 180   + ||  ||+ || + +     | |+ || || ||+||+     ||  ||  |     | Sbjct 113 AMVVVDAANLERARAIREVPSVRPVLSGLGELGHDVIDLTDSPAGAFRPDA--ARDGDTA 170 Query 181 VVIYTSGTSGRPKGVTQPYRLRRANLQVALQSPEPIVYLSTLPVSNSSGSAVDVALASGG 240 || ++||++|||||    +|++   +  + +  +    | | |+++|+|   |  | ||| Sbjct 171 VVTFSSGSTGRPKGTAWSFRVKADMVAASARRAQKATALVTAPLTHSNGFVADDVLVSGG 230 SEQ ID NO: 121 Query 241 TVVLHDGFEAGEVLRAVEQHRVSTLTITPPQLYMLIDHPDTATTDRSSIRLITYLGSPAA 300 ||||  ||+  ||||+| +++|+ | ++ |||| | |||+|  || ||+| + | |  |+ Sbjct 231 TVVLLPGFDETEVLRSVARYQVNRLAVSAPQLYALADHPETTRTDLSSVRDLFYTGVAAS 290 Query 301 PARLAEAVEVFGPVLLQLYGTTEVNGISMLMPQDHFDPELRRTVGRPTTEIR--IRDVDD 358 | |+| | +||| ||+|+|||+| | || |+  +| |  || |||||   +|  |||  | Sbjct 291 PERVAVAEKVFGSVLMQVYGTSETNIISWLIAGEHTDAGLRATVGRPLEWLRVTIRDPQD 350 SEQ ID NO: 122 Query 359 DRDLPPGEIGEVCVQSPSTMLGYWGEPELTAAIIRDGWVHTGDLGSLDENGFLRLHGRMG 418 +| || || ||| | ||  |  || +|| ||  +||||+ |||+| ||+ |+| ||||+ Sbjct 351 ERVLPTGETGEVWVNSPWRMDHYWNDPEQTARTVRDGWIRTGDVGHLDDAGYLHLHGRLA 410 Query 419 EVMKTNGIKVHPTDVENALLTHPEVTQAAVYCVVDEDRVEHIHAAVVVRPGGTADSGTLI 478  |+|||||||+|  || +|| ||+| +|||+ | + |||| ||| ||+| |  |    | Sbjct 411 GVIKTNGIKVYPVAVERSLLDHPDVAEAAVFGVENSDRVERIHAVVVLREGAGAGPEDLR 470 Query 479 GHVAAELSPKHVPAVVTFHDALPLTRAGKPDKPALAA 515  ||+ ||| | || +    +|||   |||||  | | Sbjct 471 QHVSSHLSPNHAPADIELRSSLPLIGFGKPDKLRLRA 507

Based on the studies of the present invention, it is propose that the polypeptide encoded by the moeB4 gene is an amide synthetase which participates in the conversion of Moe intermediate compound 18 or 19 in the course of moenomycin biosynthesis to yield a Moe compound MmA or 21, as shown in FIGS. 4A-4D.

B. Glycosyltransferase Genes

Five putative glycosyltransferase (GT) genes were identified within moe cluster 1, moeGT1 through moeGT5. Five moe GT genes were proposed to govern the assembly of moe A pentasaccharide moiety, but the functions of these genes was not established. Based on sequence analysis, we have suggested that moeGT1 controls the attachment of the second sugar (unit E) during moe A production. However, we were unable to isolate any monosaccharide intermediates from the producing organism, S. ghanaensis, when moeGT1 was disrupted. However, we have found that a recombinant ΔmoeGT4 S. lividans TK24 strain accumulates two new compounds having LC characteristics and exact masses consistent with monosaccharide intermediates 2 and 3 (Table 21, FIGS. 4A-4D), indicating that moeGT4 controls the attachment of the E ring. Notably, 2 and 3 contain C15 chains and we did not detect any monosaccharides having C25 chains. Strain ΔmoeGT was found to produce a trisaccharide, 11, which contained the complete C25 chain; the double mutant strain, ΔmoeGT5ΔmoeGT3, accumulated a disaccharide moe A intermediate 4 having a C15 chain (Table 21, FIGS. 4A-4D).

TABLE 21 MoeA pathway products found in S. lividans TK24 strains carrying subsets of moe genes Mutation(s) moeno38-1/ Mass ((M-H)⁻) eoexpression type Mt. Min¹ Caled Obsvd ΔmoeN5 23 4.2 1364.5026 1364.5023 22 4.1 11365.4867 1365.4884 ΔmoeGT4 2 4.7 564.2215 564.2210 3 4.8 578.2372 578.2374 ΔmoeF5 1 3.9 565.2056 565.2054 ΔmoeGT5GT3 4 4.8 781.3166 781.3143 ΔmoeGT5 11 10.4 1122.5004 1122.5004 ΔmoeGT2 15 10 1325.5797 1325.5789 ΔmoeH5 17 9.3 1501.6118 1501.6115 ΔmoeK5 24² 9.6 1486.6122 1486.6116 ΔmoeB5A5 19 9.9 1500.6278 1500.6273 5-1⁺ ΔmoeB5A5 21 9.3 1596.6490 1596.6492 5-1⁺ ΔmoeH5 17 9.7 1501.6118 1501.6122 moeR5⁺ ΔmoeB5A5 18 10.0 1484.6329 1484.6326 moeR5⁺ ΔmoeH5 16 9.4 1485.6169 1485.6195 ¹250 × 4.6 mm Agilent C₁₈ column, under LC conditions used for accurate mass determination (Ostash 2007) ²proposed structure of this compound is shown in SI

The isolation of the farnesylated mono- and disaccharides 2, 3, and 4 and the moenocinylated trisaccharide 11 strongly suggests that MoeN5 converts C15-linked precursors into C25-linked intermediates after at least three glycosylation steps. Since moenomycins without the branching glucose unit D are naturally produced by S. ghanaensis (Welzel 2005), and accumulate when moeGT3 is disrupted in this strain, we propose that moe A biosynthesis can follow two branches from precursor 4, depicted on FIG. 4, which merge at the stage of tetrasaccharide 14/15. In one branch, MoeGT3 attaches the D ring glucose; in the other, MoeGT5 attaches the C ring, which can be either GlcNAc or chinovosamine (see below). Trisaccharides 8 and 9/10 from both branches of the biosynthetic pathway must be acceptor substrates for MoeN5-catalyzed lipid chain elongation. Strain ΔmoeGT2 accumulates the tetrasaccharide moe A precursor 15 (Table 21, FIGS. 4A-4D), showing that moeGT2 controls the attachment of the B ring sugar. Thus, the gene disruption studies have allowed us to propose functions for all of the glycosyltransferases except MoeGT1 based on the identification of moe A intermediates. By a process of elimination, we propose that moeGT1 controls the first glycosylation to attach the F ring precursor to the farnesylated phosphoglycerate, which is consistent with our inability to detect any glycosylated moe A intermediates in the moeGT1-deficient S. ghanaensis mutant OB21e (Ostash 2007).

Each of the moe glycosyltransferase (GT) genes will be discussed in further detail below.

1. moeGT1

The closest homologues of the moeGT1 gene product are MurG-like UDP-N-acetylglucosamine: LPS-acetylglucosamine transferases from various bacteria (27% identity and 40% similarity to putative GT from Polaromonas sp JS666). Conserved domain database (CDD) search revealed presence of GT group 1 domain (pfam 00534) in C-terminal portion of moeGT1, as well as incomplete MurG and RfaG domains (COG0707 and COG0438, respectively). The nucleotide and polypeptide sequences are shown in Tables 22 and 23, respectively. A sequence alignment between moeGT1 and the closest homolog identified in the BLAST search is shown in Table 24. GTs having these domains have been shown to be involved in the synthesis of lipopolysaccharides. For example, in peptidoglycan biosynthesis, MurG transfers N-acetylglucosamine onto monoglycosylated carrier Lipid I, thus forming Lipid II (Men 1998, Heijenoort 2001).

TABLE 22 DNA Sequence of moeGT1. ATGGCTGCCCCCGACCGACCGCTCGTCCAGGTGCTCTCCCCCCGGACCTG GGGCGAGTTCGGCAACTACCTCGCCGCGACGCGCTTCTCCCGCGCGCTCC GGAGCGTGATCGAGGCGGAAGTGACCCTGCTGGAGGCGGAGCCGATCCTC CCGTGGATCGGCGAGGCCGGGGCGCAGATCCGGACCATCTCCCTGGAGAG CCCCGACGCCGTCGTCCGCAACCAGCGGTACATGGCCCTCATGGACCGCC TCCAGGCACGCTTCCCGGAGGGGTTCGAGGCGGACCCCACCGCCGCCCAG CGGGCGGACCTGGAACCGCTCACCCGGCACCTGCGGGAGAGCGCCCCCGA CGTGGTGGTCGGCACGAAGGGGTTCGTGGCGAGGCTGTGCGTGGCCGCCG TCCGGCTCGCCGGGACGTCCACCAGGGTCGTCAGCCACGTGACCAACCCC GGGCTGCTGCAGCTGCCGCTGCACCGCAGCCGGTACCCGGACCTGACACT CGTCGGCTTCCCCCGGGCGAAGGAGCACCTGCTGGCCACGGCCGGCGGCG ACCCGGAGCGCGTCCAGGTGGTGGGCCCGCTCGTCGCCCAGCACGACCTG CGGGACTTCATGACCAGTGAGACGGCCGTCTCCGAGGCGGGGCCCTGGGG CGGCGACTCGGGCCCGGACCGGCCACGGGTGATCATCTTCTCCAACCGCG GCGGGGACACCTACCCCGAGCTGGTGCGGCGCCTCGCCGACCGCCACCCC GGCATCGACCTCGTCTTCGTCGGCTACGGCGACCCGGAGCTCGCCCGCCG CACCGCTGCGGTCGGGCGGCCCCACTGGCGGTTCCACAGCGTCCTCGGCC AGAGCGAGTACTTCGACTACATCCGGCGTGCCTCCCGGTCCAGGTACGGG CTCCTCGTCTCGAAGGCGGGGCCCAACACCACCCTGGAGGCGGCCTACTT CGGCATACCGGTCCTGATGCTCGAGTCGGGGCTGCCCATGGAGCGGTGGG TGCCGGGACTGATCCACGAGGAGGGGCTGGGCCACGCCTGCGCCACCCCC GAGGAGCTGTTCCGCACGGCGGACGACTGGCTGACCCGCCCGTCGGTGAT CGAGGTGCACAAGAAGGCCGCGGTCTCCTTCGCCGCTTCCGTACTGGACC AGGACGCGGTGACGGCCAGGATCAAGGCCGCCCTCCAGCCCCTGCTGGAC GCCCGATGA (SEQ ID NO: 8)

TABLE 23 Amino Acid Sequence of moeGTI MAAPDRPLVQVLSPRTWGEFGNYLAATRFSRALRSVIDAEVTLLEAEPIL PWIGEAGAQIRTISLESPDAVVRNQRYMALMDRLQARFPEGFEADPTAAQ RADLEPLTRHLRESAPDVVVGTKGFVARLCVAAVRLAGTSTRVVSHVTNP GLLQLPLHRSRYPDLTLVGFPRAKEHLLATAGGDPERVQVVGPLVAQHDL RDFMTSETAVSEAGPWGGDSGPDRPRVIIFSNRGGDTYPELVRRLADRHP GIDLVFVGYGDPELARRTAAVGRPHWRFHSVLGQSEYFDYIRRASRSRYG LLVSKAGPNTTLEAAYFGIPVLMLESGLPMERWVPGLIHEEGLGHACATP EELFRTADDWLTRPSVIEVHKKAAVSFAASVLDQDAVTARIKAALQPLLD AR (SEQ ID NO: 31)

TABLE 24 Sequence Homology of moeGT1 gi|84696063|gb|EAQ21850.1|similar to UDP-N- acetylglucosaraine:LPS N-acetylglucosamine transferase [Polaromonas naphthalenivorans CJ2] Length = 599 Score = 36.6 bits (83), Expect = 2.3, Method: Composition-based stats. Identities = 68/246 (27%), Positives = 100/246 (40%), Gaps = 17/246 (6%) SEQ ID NO: 123 Query 160 SRYPDLTLVGFPRAKEHLLATAGGDPERVQVVGPLVAQHDLRDFMTSETAVSEAGPWGGD 219 |+  | | +     +   || ||  |++|   | +  +    |  | |||++  | Sbjct 144 SKRIDRTFLAHTDLESRWLA-AGVPPDKVTTSG-MPVRAPAADGATRETALTALG----- 196 SEQ ID NO: 124       SEQ ID NO: 240     SEQ ID NO: 241                  SEQ ID NOS: 125  126    127 Query 220 SGPDRPRVIIFSNRGG-DTYPELVRRLADRHPG-IDLVFV-GYGDPELARRTAAVGR-PH 275   || | |+| | + |   |  +|  ||  ||| + ++ | |    + |  ||   | | Sbjct 197 LAPDAPTVLITSGKEGVGDYALVVESLARHHPGPLQIIAVCGANARQQALLTALQKRLPE 256 SEQ ID NO: 128 SEQ ID NO: 129 Query 276 WRFHSVLGQSEYFDYIRRASRSRYGLLVSKAGPNTTLEAAYFGIPVLMLESGLPMERWVP 335      | |   + | +  |      ||++|||  |  ||   | | ++|+     || Sbjct 257 PVALKVCGLVPHADLL--AWMRAADLLITKAGGMTPAEAFAVGTPTILLDVVSGHERENA 314                   SEQ ID NO: 130 Query 336 GLIHEEGLGHACATPEELFRTADDWLTRPSVIEVHKKAAVSFAASVLDQDAVTARIKAAL 395  |    |+     |  +    |   |  |      ++| ++|     |+  +    + || Sbjct 315 ALFVRLGVADLADTLAQAGELAAAVLASPQRQTAMRRAQLAFH----DRAGLGRIARFAL 370                                                SEQ ID NO: 131 Query 396 QPLLDA 401  | | | Sbjct 371 DPALPA 376

To evaluate the function of moeGT1 with respect to moe biosynthesis, a recombinant S. ghanaensis strain, termed OB21e, was generated (see Materials and Methods, see FIGS. 10A-10B). OB21e included an insertionally inactivated moeGT1 gene. The moeGT1 deficient OB21e mutant did not produce moe A or any of its antibiotically active precursors as determined by bioassays and LC-MS analysis (see FIGS. 11A-11D). To exclude the possibility of polar effects of the moeGT1 knockout on downstream genes expression, we introduced a functional copy of the moeGT1 gene under the control of the ermE promoter (plasmid pOOB41c) into OB21e strain. This complemented the moe A nonproducing phenotype of the OB21e mutant, yielding a full-size, functional moe A product. The trisaccharide degradation product of moe A (units C-E-F-G-H, FIG. 1) is known to display, in vivo, the full antibacterial activity of parent compound (Welzel 1987). Based on the studies of the present invention, it is propose that the polypeptide encoded by moeGT1 gene is a glycosyl transferase that attaches the first sugar (e.g., glucuronic acid; GalA) to the phosphoglycerate-farnesyl moiety of Moe intermediate compound 1P in the course of moenomycin biosynthesis to yield a Moe intermediate compound 1, as shown in FIGS. 4A-4D.

2. moeGT2, moe GT3, moeGT4 and moe GT5

The putative translation product of the moeGT2 gene shows homology to known GTs involved in lipopolysaccharide O-antigen biosynthesis in Yersinia enterocolitica, Escherichia coli and Streptococcus agalactiae (28% identity and 47% similarity) (Zhang 1997, Paton 1999, Chaffin 2002). The nucleotide and polypeptide sequences are shown in Tables 25 and 26, respectively. A sequence alignment between moeGT2 and the closest homolog identified in the BLAST search is shown in Table 27. MoeGT2 also contains a conserved GT domain (pfam00535) present in very diverse family-2 GTs, which transfer sugars to a range of substrates including cellulose, dolichol phosphate and teichoic acids.

TABLE 25 DNA Sequence of moeGT2 CTGACACACGAGGTCACCCCGAGGGGCGGCCCGGAAGGAGACGCGATGGT GACAGCGGGGCCGGCCGGGGCGGCGGTGACCGTCGTCCTGCCTCACTACG ACTGCGCGGCGTACCTGGGTGCGGCCGTCGGATCGGTGCTCTCCCAGGAC CGCCCGGACCTGCGCCTGACGGTGGTGGACGAATGCTCGCCCGAAGAGAA GTGGGCCCGCGCACTCCACCCGTACGCCGGCGACCCCCGGCTGACCGTGG TCCGCACCTCCCGCAACGTCGGCCACCTGCGGATCAAGAACAAGGTCCTG GAATCGGTGGACACCCCCTACGTGGCCTTCCAGGACGCCGACGACATCAG CCTGCCGGGCCGGCTGCGCCACCAGCTGGCCCTCCTGGAGAGCGGCGGCG CCGATCTGGTCGGCTGCGCCTACTCCTACATCGACGAGGCGGGCCGTACG ACGGGACACCGGCGGATGCCCCGCAACGGCAACCTCTGGATGCGGCTGGG GCGGACGACCGTGCTCCTGCACCCGTCCTCGGTGGTGCGGCGCTCGGTGC TCGAGAGGCTCGGCGGCTTCGACGGCACCGCGCGCCTGGGGGCCGACACC GACTTCCACCTGCGGGCCGCCCGCCTGTACCGGCTGCGCAGTGTGCGCAA GGTGCTCTACCGGTACCGGATCTGGCCCAAGTCGCTCACCCAGGCGCCGG ACACCGGGTTCGGGTCCGCGGAGCGCCGGGCCTACACCGAGGCGATGACC GCGCAGGAGGAGCGGCGGCGACGGGCGCGGACCCGTGAGGAGCTGCTGCC GCTGCTGGTCGCCCCGCCCAACGACGTCGACTTCACCCTGACCCGGGTCG ACCTCGACTAG (SEQ ID NO: 9)

TABLE 26 Amino Acid Sequence of moeGT2 LTHEVTPRGGPEGDAMVTAGPAGAAVTVVLPHYDCAAYLGAAVGSVLSQD RPDLRLTVVDECSPEEKWARALHPYAGDPRLTVVRTSRNVGHLRIKNKVL ESVDTPYVAFQDADDISLPGRLRHQLALLESGGADLVGCAYSYIDEAGRT TGHRRMPRNGNLWMRLGRTTVLLHPSSVVRRSVLERLGGFDGTARLGADT DFHLRAARLYRLRSVRKVLYRYRIWPKSLTQAPDTGFGSAERRAYTEAMT AQEERRRRARTREELLPLLVAPPNDVDFTLTRV DLD (SEQ ID NO: 32)

TABLE 27 Sequence Homology of moeGT2 gb|AAU93096.1|glycosyl transferase, group 2 family protein [Methylococcus capsulatus str. Bath] Length = 367 Score = 104 bits (260), Expect = 4e−21 Identities = 80/228 (35%), Positives = 117/228 (51%), Gaps = 7/228 (3%) SEQ ID NO: 132       SEQ ID NO: 133 Query 7 PRGGPEGDAMVTAGPAGAA--VTVVLPHYDCAAYLGAAVGSVLSQDRPDLRLTVVDECSP 64 |   |  |    + | | |  ||+++| |+   || ||+ |+| |   |  | ++|+ | Sbjct 11 PHHVPHRDMTHHSRPPGHAPRVTILMPVYNGEKYLAAAMESILDQTFRDFILLIIDDGSS 70 SEQ ID NO: 242 Query 65 EEKWARALHPYAGDPRLTVVRTSRNVGHLRIKNKVLESVDTPYVAFQDADDISLPGRLRH 124 +   | |     ||||+ | |  +|+| ++  |+ |+ | | +||  | |||+|| || Sbjct 71 DSSLAIARS--FGDPRVQVERNPKNLGLVKTLNRGLDLVQTEFVARMDCDDIALPDRLEK 128            SEQ ID NO: 134       SEQ ID NO: 135 Query 125 QLALL-ESGGADLVGCAYSYIDEAGRTTGHRRMPRNGNLWMRLGRTTVLLHPSSVVRRSV 183 |+| | |+    + | ||    |+ | |  |   |+  +   |    | || | +||  | Sbjct 129 QIAFLDENPDIGMCGTAYELFHESLRQT-IRPPCRHEEIVYGLLDDNVFLHSSVIVRMEV 187                              SEQ ID NO: 136       SEQ ID NO: 137 Query 184 LERLG-GFDGTARLGADTDFHLRAARLYRLRSVRKVLYRYRIWPKSLT 230 | | |  +    ||  | +   | ||   + ++ +|| |||  |++++ Sbjct 188 LNRHGLRYREDYRLAEDYELWARLARYTHIGNLPQVLVRYRSHPENVS 235

Based on the studies of the present invention, it is propose that the polypeptide encoded by the moeGT2 gene is a glycosyltransferase that attaches a sugar moiety (e.g., glucuronic acid (GalA)) to the Moe intermediate compound 14 or 15 in the course of moenomycin biosynthesis to yield a Moe intermediate compound 16 or 17, respectively, as shown in FIGS. 4A-4D.

The product of the moeGT3 gene is similar to family-2 GTs involved in antibiotic production (32% identity and 45% similarity to AprG1 GT from S. tenebrarius apramycin gene biosynthetic cluster) (Du 2004), and biofilm and antigen biosynthesis (23% identity and 44% similarity) (Kaplan 2004, Wang 2004). It also has homology to putative GTs involved in cell wall biogenesis. The moeGT3 gene includes a putative conserved GT domain (COG1216) which is present in many GTs. The nucleotide and polypeptide sequences are shown in Tables 28 and 29, respectively. A sequence alignment between moeGT3 and the closest homolog identified in the BLAST search is shown in Table 30.

TABLE 28 DNA Sequence of moeGT3 GTGGCCGTCCTCCGCGGTGACGACGAGGCGCTCCCCCACTGGCTGTGGCA CCTGGCGCGGGCGGTCTGGTACGGCGGCGGGGACGGCACCGGGCCGGTCG GCCTGGTGCAGTGCGGCGCCCTGCGGCTGAGGGACGACGGCCTGGTGGAC GGGTTCGCCCTGCCGCCCGCGTCCCCGCGGACCCGGCCCTCCCCCTCGGA CCTCCTCGAGGGCGCCTACGCGGTGCGGCGCGAACTGCTGGACGCGGACG GCGGTACGGCGCCCTGGGTCGCCCTGCCCATGCCGCTGGTCCGCCGCCGG TCCGGCGGCGCCGGGGACCCGGCCGCGGTCCTGGCCCCCGGGACGCGCGT CGCGCGACGCACCCGCCTGGTCCGGCACGGGTACCGGCCGCCCGCCGCGA GGCCGCGGAACGGGAGCACTCCCCGGCTGGTGTCGGTGGTCGTCCCGGTG CGCAACGGCGCCCGCACGCTCGCCGCCCAGCTGACCGCCCTGGCCCGGCA GACCGGAGCCGTCGCCTACGAGGTGCTGGTCGTCGACAACGGCTCGACGG ACACCACCCGCGAGGTCGCCGAACGGGCCCGCGCCGAGCTGCCGGACCTG CGGATCGTGGACGCGTCCGACCGTGCCGGTGAGAGCTGTGCCCGCAACCG GGGAATCGCCGCGGCGCGCGGCGACTTCGTCGCGTTCTGCGACGCGGACG ACGTCGCCGACACCGGCTGGCTGGCCGCGATGGCCCAGGCGGCCAAGGAG GCCGATCTGGTGGGAGGCGGACTGGAGACCTCCGTGCTCAGTCCCGGCCG CGTCGACGAGCAGCCCCTGCCGATGGACGCCCAGACCGATTTCCTGCCGT TCGCCCGGGGGGCGAACTGCGGTGCCTGGAAGGACGTCCTGACCGCGCTG GGCGGCTGGGACGAGCGCTACCGGGGCGGCGGGGAGGACATGGACCTCTC CTGGCGCGCCCAGCTCTGCGGTTACCTCGTCCGCTACGCGGACGACGCCC GGATGCACTACCGGTTGCGGGACGGACTGCCGGCGCTGGCACGGCAGAAG TGGAACTACGGCCGTTCCGGGGCCCAGTTGTACGCCGCGTACCGGCGCGC CGGGTTCGAACGGCGCGACGGCCGGGTGGTCGTCAGGAACTGGTGCTGGC TGCTGCTGCACGTTCCGAACCTGGTCCGGTCCACCGGACCCTGCGGCCAC GCTGAGTCCGCTACGCGCCCGGCTGGCCGGTTTCCTGGTTTG TGA (SEQ ID NO: 10)

TABLE 29 Amino Acid Sequence of moeGT3 VAVLRGDDEALPHWLWHLARAVWYGGGDGTGPVGLVQCGALRLRDDGLVD GFALPPASPRTRPSPSDLLEGAYAVRRELLDADGGTAPWVALPMPLVRRR SGGAGDPAAVLAPGTRVARRTRLVRHGYRPPAARPRNGSTPRLVSVVVPV RNGARTLAAQLTALARQTGAVAYEVLVVDNGSTDTTREVAERARAELPDL RIVDASDRAGESCARNRGIAAARGDFVAFCDADDVADTGWLAAMAQAAKE ADLVGGGLETSVLSPGRVDEQPLPMDAQTDFLPFARGANCGAWKDVLTAL GGWDERYRGGGEDMDLSWRAQLCGYLVRYADDARMHYRLRDGLPALARQK WNYGRSGAQLYAAYRRAGFERRDGRVVVRNWCWLLLHVPNLVRSTGPCGH AESATRPAGRFPGL (SEQ ID NO: 33)

TABLE 30 Sequence Homology of moeGT3 ref|ZP_10616987.1| Glycosyl transferase, family 2 [Kineococcus radiotolerans SRS30216] Length = 289 Score = 197 bits (500), Expect = 1e−48 Identities = 122/277 (44%), Positives = 156/277 (56%), Gaps = 8/277 (2%) SEQ ID NO: 138 Query 144 VSVVVPVRNGARTLAAQLTALARQTGAVAYEVLVVDNGSTDTTREVAERARAELPD-LRI 202 ||||+|  |  | | ||| ||| |+    +||+| ||||||   |  |     +|  || Sbjct 5 VSVVIPCFNATRDLPAQLEALAGQSTVCTFEVVVSDNGSTDGLAEFVEEWSRRVPFMLRR 64 SEQ ID NO: 139  SEQ ID NO: 243 Query 203 VDASDRAGESCARNRGIAAARGDFVAFCDADDVADTGWLAAMAQAAKEADLVGGGL---- 258 |||| | | + ||| |  ||  | +  ||||||   ||+ |||+| ++|||||| | Sbjct 65 VDASARRGVAHARNAGCRAALADVILVCDADDVVGVGWVDAMARALEQADLVGGTLVHGH 124  SEQ ID NO: 140 Query 259 -ETSVLSPGRVDEQPLPMDAQTDFLPFARGANCGAWKDVLTALGGWDERYRGGGEDMDLS 317   |+++   |    |  +  +   ||+| ||| |  ++|  ||||||| +  ||+|++ | Sbjct 125 LNTALVQQWRPTSPPGVLPTKLSHLPYAVGANVGLRREVFDALGGWDEGFVAGGDDVEFS 184 Query 318 WRAQLCGYLVRYADDARMHYRLRDGLPALARQKWNYGRSGAQLYAAYRRAGFERRDGRVV 377 ||||  |+ +| | || + ||+|  | |  +| + | || | |   +| ||  ||  | + Sbjct 185 WRAQHAGFCLRSAPDAVIAYRMRTTLSANVKQSYFYARSDALLMRTFRSAGVPRRGLRPL 244                         SEQ ID NO: 141 Query 378 VRNWCWLLLHVPNLVRSTGPCGH-AESATRPAGRFPG 413 +    ||+ +||   |  |  |     |   |||| | Sbjct 245 ITESKWLVRNVPR-TREPGFRGQWLRRAAMLAGRFVG 280               SEQ ID NO: 142

Based on the studies of the present invention, it is propose that the polypeptide encoded by the moeGT3 gene attaches is a glycosyltransferase that attaches a sugar moiety (e.g., glucose (Glc)) to the Moe intermediate compound 4, 12 or 13 in the course of moenomycin biosynthesis to yield a Moe intermediate compound 5, 14 or 15, respectively, as shown in FIGS. 4A-4D.

Gene moeGT4 encodes putative 427 amino acid protein which N-terminal portion shows moderate homology (27% identity and 38% similarity) to putative family 2 GT from Mycobacterium vahbaalenii PYR-1. The nucleotide and polypeptide sequences are shown in Tables 31 and 32, respectively. A sequence alignment between moeGT4 and the closest homolog identified in the BLAST search is shown in Table 33. CDD search failed to locate conserved domain(s) within moeGT4. However, more careful inspection of moeGT4 sequence using the HHpred program (at ExPaSy proteomics server) showed that the C-terminus of moeGT4 exhibits a low degree of homology to chitin synthase and GT 2 conserved domains (accession numbers pfam03142 and COG1216, respectively).

TABLE 31 DNA Sequence of moeGT4. GTGACTTCTGAGCCCGCCGCCCCGGCCGTCCCGCACCCGCCGGTGCGTCC GGGGCCGCCGGTCCGTCTCAACCGGCCGCTGGCGCGGCGCAGGCGGCGGC CGGCCGGGGAGGGGTTCGTGACGCACCACCTGCGGAGCACCATGGCCCGC GGGTTCCGCCCCCCGGAGTCCTGGGAGGTCCCCGTCCGGCACGTCCTGCC CGGTCTGCCGGCCGACGGGACTCCGCGCGCCGAGGAGGCCGCTCAGGCGC TGCGCACGCCCGCCGGGCGGCCGGGCATCGCCCTCGTCGTGCCGACCTAC GTCTCCCGGGTGAGCCTGGCGCGGCAGCGGGAGTGGTTCGACGCGCTGCT GGACCAGGCGGCCGCGGTGACGCGGGACCACCCCCTGGTGCCCCTGGTGC TGTTCGTCGGCATGCAGTGGTCGTCGGCCGAGGAGGAGCGGGAGGCGCTG CGGCGCCTGCGTGTGCTGCTGGACGACGCCCGCACCCGGCTGCCCGGACT GCGGATCTGCGGTCTCGCGCTGCCCGGGCCGGGCAAACCCCGCACCCTCA ACGGGGCGATCGCCGTCGCCGAGCTCCTCGGCTGTGCGGGCGTCGGGTGG ACCGACGACGACGTGACCCTGGAGGAGGACTGCCTGTCCCGGCTGGTGCG GGACTTCCTGGCGGCGGGCTGCCGCGGGGCGGTGGGCGCGACCAAGGTTG CGCACACCCATGAGTACGCCACCTCCCGGCTGCTGTCCCGGGCCAAGGCG ATCGCCGCCCCGGCCACGAACTACCCGCACGGCTGCTGCATCCTGGTGGC CACCGACGTGGTGGCCGGTGGTCTGCCGGGACGCTACGTATCCGACGACG GCTACGTGTGCTTCCGCCTCCTCGACCCCGCGCTGCCCGACCCGCTGGCC CGGCTGCGGCTGGTTCCGGACGCCCGGTGCCACTACTACGTGGCGGGGCC GGCCGGCGAGACCCGCCGCAGGATCCGCAGGCTGCTGCTCAACCACCTCG TCGACCTCGCCGACTGGCCCCTGCCGGTGGTCCGTCACTACTTCCGCCAC GTCCTGTTCGGCGGCATGTGGCCGCTGACCGGCTTCGACTCCTCCCGCGG TGCCCGCCGCGGTGTGCAGAAGGCGCTCATCAAGTGGCTCTACTTCGCCT GGTTCGCGGGCATCGGGGGCGAACTCTACGTGCGCGGGCTGTCCGGCAGG CCACTGCGCCGCATCGAGTGGGCTCCCTACTCGGACATCCGCAGGCTCAC TCCGTCGTCCTCACCCACGCGTCAGGAGAGCTGA (SEQ ID NO: 11)

TABLE 32 Amino Acid Sequence of moeGT4 VTSEPAAPAVPHPPVRPGPPVRLNRPLARRRRRPAGEFVTHHLRSTMARG FRPPESWEVPVRHVLPGLPADGTPRAEEAAQALRTPAGRPGIALVVPTYV SRVSLARQREWFDALLDQAAAVTRDHPLVPLVLFVGMQWSSAEEEREALR RLRVLLDDARTRLPGLRICGLALPGPGKPRTLNGAIAVAELLGCAGVGWT DDDVTLEEDCLSRLVRDFLAAGCRGAVGATKVAHTHEYATSRLLSRAKAI AAPATNYPHGCCILVATDVVAGGLPGRYVSDDGYVCFRLLDPALPDPLAR LRLVPDARCHYYVAGPAGETRRRIRRLLLNHLVDLADWPLPVVRHYFRHV LFGGMWPLTGFDSSRGARRGVQKALIKWLYFAWFAGIGGELYVRGLSGRP LRRIEWAPYSDIRRLTPSSSPTRQES (SEQ ID NO: 34

TABLE 33 Sequence Homology of moeGT4 gb|EAS23724.1| Glycosyl transferase, family 2 [Mycobacterium vanbaalenii PYR-1] Length = 426 Score = 35.8 bits (81), Expect = 3.9 Identities = 49/181 (27%), Positives = 69/181 (38%), Gaps = 29/181 (16%) SEQ ID NO: 143 Query 42 HHLRSTMARGFRPPESWEVPVRHVLPGLPADGTPRAEEAAQALRTPAGRPGIALVVPTYV 101 || |  +    +|    |||||  |  |||   |   +| |  |          ++|  | Sbjct 52 HHARVLVRHQGQPVAFVEVPVRDALIRLPACPLPEDLDAGQPAR----------LMPISV 101 SEQ ID NO: 244                                 SEQ ID NO: 144 Query 102 SRVSLARQREWFDALLDQAAAVTRDHPLVPLVLFVGMQWSSAEEEREALRRLRVLLDDAR 161    +  |  +  |||    + ++ |+|   +|+             +|   +   | ||| Sbjct 102 VLCTRDRPDQLADAL---KSILSLDYPEFEVVVV------DNAARTDATAGVVAQLGDAR 152                   SEQ ID NO: 145        SEQ ID NO: 146 Query 162 TRLPGLRICGLALPGPGKPRTLNGAIAVAELLGCAGVGWTDDDVTLEEDCLSRLVRDFLA 221  |       +| | ||     |  +  |       | +||||| ++   |  | | | Sbjct 153 VR-------RVAEPIPGLSTARNTGLRHA---AHPVVAFTDDDVVVDRQWLRGLARGFAR 202          SEQ ID NO: 147         SEQ ID NO: 148 Query 222 A 222 | Sbjct 203 A 203

Based on the studies of the present invention, it is propose that the polypeptide encoded by the moeGT4 gene is a glycosyltransferase that attaches sugar moiety (e.g., N-acetylglucosamine (GlcNac)) to the Moe intermediate compound 2 or 3 in the course of moenomycin biosynthesis to yield a Moe intermediate compound 4 as shown in FIGS. 4A-4D.

The 312 amino acid protein encoded by the moeGT5 gene is homologous to the central part of moeGT4 (45% identity and 59.1% similarity). HHpred results suggested that an incomplete COG1216 domain is also present in moeGT5. The nucleotide and polypeptide sequences of moeGT5 are shown in Tables 34 and 35, respectively.

TABLE 34 DNA Sequence of moeGT5 GTGCTGCGCCGTCTGGCCGAGGTGCGGGAAGCGCACCCGTCCCTGCCGCT GACCGTCTGGGTGGGCATGCAGTACGGCCCCGGGGAGGACGAGGAGGCGC TGCGCAGGCTGCGCCGGCTGTGCGCCCCGGTGCCCGGGGGCCCGGCCCTC ACCGTGGTCGGCCTGGCCCTGCCCGGGCCGGGCAAGCTCCGCACGGTGAG CACGGTCCTGCGGCTCTCCGAGGACCTCGGCTACGCCGGCTGGCTCTGGA CGGACGACGACATCGAGATCGCCCCCCACTGCCTCGCCCTGCTGGTCTCC CGTTTCCGGGAGCGGGGGGAGCGGGGCGCGGTCGGGGCGCATTCGGTCGC GCTGGCCAGGGAGACGGTCACCTCACAGGCCATGGACCGGGTCTCCGGGG TCACCGCCCCGCCGAAGGCCTGCCCGGCGGCGGCCTGCCTGGTCGTCGCG ACGGACGTGCTGGGCACCGGCATTCCGGTCAGGCGCCTGACCGACGACGG GTACGTGGTGTTCGAACTGCTCGACGCCGGGGCGCCCGATCCGCTGCACG ACCTGGAGGTGCTGCCCGAGGCCCGGATCAGCTTCTACCGCGTCAGCCGC ACCCACGACACGTTCCAGCGCCTGCGCCGCTCCCTCTACAGCCATGTGAC CTGCGTCGCCGACTATCCCTGGCCCACCGCGCGGGTCTACCTCACCCGGG TCCTCTTCCACGGTCTGTGGCCGCTCGCGGCGTGGGACGGCAGCCGGGGG CCGGTGCACGGGCTGCAGCGCTGGCTGGTCAAGGGCCTGCACTTCACCTG GTTCTGCGGGGTGGCCGGCTCGCTGGCGGTCCGGGGCGCGGTGGGACGGC CCCTTCGCCGGGTGGCGTGGGGCGACGAGGGGGACTTCCGCAGCCCCACC GTCGAGGAGCCCGCCGCGGGAGCGGCCGCCGGGCGCTGA (SEQ ID NO: 12)

TABLE 35 Amino Acid Sequence of moeGT5 VLRRLAEVREAHPSLPLTVWVGMQYGPGEDEEALRRLRRLCAPVPGGPAL TVVGLALPGPGKLRTVSTVLRLSEDLGYAGWLWTDDDIEIAPHCLALLVS RFRERGERGAVGAHSVALARETVTSQAMDRVSGVTAPPKACPAAACLVVA TDVLGTGIPVRRLTDDGYVVFELLDAGAPDPLHDLEVLPEARISFYRVSR THDTFQRLRRSLYSHVTCVADYPWPTARVYLTRVLFHGLWPLAAWDGSRG PVHGLQRWLVKGLHFTWFCGVAGSLAVRGAVGRPLRRVAWGDEGDFRSPT VEEPAAGAAAGR (SEQ ID NO: 35)

Based on the studies of the present invention, it is propose that the polypeptide encoded by the moeGT5 gene is a glycosyltransferase that attaches a sugar moiety (e.g., N-acetylglucosamine) (GlcNac)) or chinovosamine (Ch) moiety to the Moe intermediate compound 4 in the course of moenomycin biosynthesis to yield a Moe intermediate compound 6 or 7, respectively, as shown in FIGS. 4A-4D. Further, based on the studies of the present invention, it is propose that the polypeptide encoded by the moeGT5 gene is a glycosyltransferase that attaches a sugar moiety (e.g., N-acetylglucosamine) (GlcNac)) or chinovosamine (Ch) moiety to the Moe intermediate compound 11 in the course of moenomycin biosynthesis to yield a Moe intermediate compound 14 or 15, respectively, as shown in FIGS. 4A-4D.

C. Sugar Tailoring Genes

Seven genes were identified in cluster 1 that fit this activity profile: moeF5, moeH5, moeK5, moeM5, moeS5, moeR5 and moeE5.

The moeF5 and moeH5 genes share significant homology at the nucleotide sequence level, suggesting that the pair arose via a gene duplication event. The proteins encoded by these genes resemble a large family of ATP-dependent amidotransferases that form amides from carboxylic acids, but neither protein appeared fully functional based on sequence analysis. Therefore, we previously speculated that carboxamidation of unit F resulted from the activity of a MoeFSMoeHS heterodimer, with MoeFS generating ammonia from glutamine in an ATP-dependent manner and MoeH5 acting as an amidotransferase (Ostash 2007).

1. moeF5

The product of moeF5 gene resembles putative and known asparagine synthase B related enzymes from various bacteria (36% identity and 46% similarity). The nucleotide and polypeptide sequences are shown in Tables 36 and 37, respectively. A sequence alignment between moeF5 and its closest homolog is shown in Table 38.

TABLE 36 DNA Sequence of moeF5. ATGTGCGGCTTCGTCGGATTCAGTGACGCCGGCGCCGGGCAGGAGGACGC CCGTGTCACGGCCGAGCGCATGCTCGCCGCCGTGGCGCACCGCGGCCCCG ACGGCTCGGACTGGTGCCACCACCGGGGCGTCACCCTCGCGCACTGCGCC CTGACCTTCACCGATCCGGACCACGGCGCGCAGCCGTTCGTCTCCGCGTC GGGAGCCACCGCCGTGGTGTTCAACGGCGAGCTCTACAACCACGCCGTGC TGGGCGACGGGGCGTTGCCCTGCGCACCCGGAGGCGACACAGAAGTTCCT GGTGGAACTCTACGAGTTGCTGGGCATGCGGATGCTCGACCGGCTGCGGG GCATGTTCGCCTTCGCGCTGCAGGACGCCCGCACCGGCACCACGGTGCTG GCCGCGACCGATGGGGAAGAGCCCCTCTACTAACACCCGCGTGCGAGACG GACATCGCTTTCGCGTCGGAACTCACGTCTCTGCTGCGGCACCCCGCCGC GCCGCGCACACCGGAGGTGCGGGCGCTCGCCGACTACCTGGTGCTCCAGG CGTTCTGCGCCCCCGCCTCGGCCGTGTCGGGGGTGTGCAAGGTGCGCCCC GGCAGCTACGTGACCCACCGGCACGGCGCGTTGGACGAGACCGAGTTCTG GCGGCCCCGCCTGACCCCCGACCGGGGGGCGGGCCGCGGCCCCGGACGGC GGGAGGCCGCGCGGCGGTTCGAGGAGCTCTTCCGCGCCGCGGTCGCCCGC CGGATGACCAGCACCGACCGCCGCCTCGGCGTACTGCTCAGCGGCGGCCT GGACTCCAGCGCGGTCGCCGCGGTGGCCCAGCAGCTCCTGCCGGGACGGC CGGTGCCCACCTTCAGCGCGGGGTTCGCGGACCCGGACTTCGACGAGAGC GACCACGCACGGGCGGTGGCGCGCCACCTCGGCACCGAGCACCATGTGGT GCGGATCGGCGGGGCCGACCTCGCCGGTGTGGTGGAGTCCGAACTCGCCG TGGCCGACGAGCCGTTGGCCGATCCCTCCCTGCTGCCCACACGTCTGGTC TGCCGGGCGGCGCGCGAGCACGTCCGCGGCGTGCTCACCGGTGACGGCGC GGACGAACTGCTCCTGGGCTACCGCTACTTCCAGGCCGAGCGGGCGATCG AGCTGCTGCTGCGCGTGCTGCCGGCCCCCCGGCTGGAGGCCCTCGTCCGG CTGCTGGTGCGCCGGCTGCCGGCCCGTTCCGGCAACCTCCCCGTGACCCA CGCCCTCGGTCTGCTGGCCAAGGGCCTGCGCGCGGCACCGGAGCACCGGT TCTACCTCTCGACGGCGCCCTTCGGCCCGGGCGAGCTGCCACGGCTGCTC ACCCCCGAGGCCGGGGCCGAACTGACCGGGCACGACCCGTTCACCGAGGT GTCGCGCCTCCTGCGGGGACAGCCGGGCCTGACCGGTGTCCAGCGCAGCC AGCTCGCCGTGGTGACCCACTTCCTGCGGGACGTGATCCTCACCAAGACG GACCGGGGCGGCATGCGCAGCTCCCTCGAGCTGCGTTCCCCCTTTCTCGA CCTGGACCTGGTCGAGTACGGCAACTCCCTGCCCACCGGCCTGAAGCTGC ACCGGTTCACCGGCAAGTACCTGCTGCGGCAGGTCGCCGCCGGCTGGCTG CCCCCTTCCGTCGTCCAGCGGACGAAGCTGGGTTTCCGCGCGCCGGTGGC GGCCCTGCTCCGCGGCGAGCTGCGGCCCCTGCTCCTGGACACCCTCTCCC CGTCGTCCCTGCGCCGCGGCGGCCTGTTCGACACCGGGGCGGTGCGCCTG CTGATCGACGACCACCTCGGCGGCCGGCGCGACACCTCCCGCAAGCTGTG GGCGCTGCTGGTCTACCAGCTCTGGTTCGAGAGCCTGACGGCCGGACCCC GCGCCCTCGAGTCCCCCGCGTACCCGGCCCTCTCCTAG (SEQ ID NO: 13)

TABLE 37 Amino Acid Sequence of moeF5 MCGFVGFSDAGAGQEDARVTAERMLAAVAHRGPDGSDWCHHRGVTLAHCA LTFTDPDHGAQPFVSASGATAVVFNGELYNHAVLGDGALPCAPGGDTEVP GGTLRVAGHADARPAAGHVRLRAAGRPHRHHGAGRDRWGRAPLLTPACET DIAFASELTSLLRHPAAPRTPEVRALADYLVLQAFCAPASAVSGVCKVRP GSYVTHRHGALDETEFWRPRLTPDRGAGRGPGRREAARRFEELFRAAVAR RMTSTDRRLGVLLSGGLDSSAVAAVAQQLLPGRPVPTFSAGFADPDFDES DHARAVARHLGTEHHVVRIGGADLAGVVESELAVADEPLADPSLLPTRLV CRAAREHVRGVLTGDGADELLLGYRYFQAERAIELLLRVLPAPRLEALVR LLVRRLPARSGNLPVTHALGLLAKGLRAAPEHRFYLSTAPFGPGELPRLL TPEAGAELTGHDPFTEVSRLLRGQPGLTGVQRSQLAVVTHFLRDVILTKT DRGGMRSSLELRSPFLDLDLVEYGNSLPTGLKLHRFTGKYLLRQVAAGWL PPSVVQRTKLGFRAPVAALLRGELRPLLLDTLSPSSLRRGGLFDTGAVRL LIDDHLGGRRDTSRKLWALLVYQLWFESLTAGPRALESPAYPALS (SEQ ID NO: 36)

TABLE 38 Sequence Homology of moeF5 gi|20560076|gb|AAM27821.1| ORF_10; similar to Asparagine synthase [Pseudomonas aeruginosa] gi|6690135|gb|AAF24002.1| WbpS [Pseudomonas aeruginosa] Length = 627 Score = 198 bits (503), Expect = 1e−48 Identities = 193/645 (29%), Positives = 279/645 (43%), Gaps = 35/645 (5%) Frame = +1 SEQ ID NO:149  SEQ ID NO:150                   SEQ ID NO:151 Query 286     MCGFVGFSD-AGAGQEDARVTAERMLAAVAHRGPDGSDWCHH-RGVTLAHCALTFTD-P 453     |||  || +  |    |    | +| ||+ ||||| |   +   |   | |  |   + Sbjct 1     MCGIAGFWNITGTLLGDNARVARQMAAAIHHRGPDESGIWYEAPRAPILVHARLAVLELS 60     SEQ ID NO: 152     SEQ ID NO: 245            SEQ ID NO:153   SEQ ID NO: 154 Query 454     DHGAQPFVSASGATAVVFNGELYNH----AVLGDGALPCA--PGGDTEVPGGTLRVAG-H 612       |+||  |  |   +++|||+|||    | | +  +  +   | |||         | Sbjct 61     PAGSQPMHSDCGRYVLIYNGEIYNHLALRARLSEAGVTHSWRGGSDTETLLACFAQWGVE 120     SEQ ID NO: 155 Query 613     ADARPAAGHVRLRAAGRPHRHHGAGRDRWGRAPLLTPACETDIAFASELTSLLRHPAAPR 792     +  +   |   |    |  +     ||| |  ||        + ||||| +|  || Sbjct 121     STLKLTVGMFALALWDRQEKTITLARDRMGEKPLYWGWQNGVLFFASELKALKEHPLFRG 180                                                      SEQ ID NO: 156 Query 793     TPEVRALADYLVLQAFCAPASAVSGVCKVRPGSYVTHRHGALDET----EFWRPRLTPDr 960       +  ||| +|      || |   |+ |+| |||+     +|+||     +|      + Sbjct 181     DIDRDALALFLRYGYVPAPYSIYKGIGKLRAGSYLVLSERSLNETCEPAAYWSANAAIEE 240 Query 961     gagrgpgrreaarrfeelfraavarrMTSTDRRlgvllsggldssavaavaQQLLPGRPV 1140              +|        +   +                |   |+      |    ||+ Sbjct 241     ALSNPFQGTDAEAVDLLESQLRTSISDQMVSDVPLGAFLSGGVDSSTVVALMQQQSSRPI 300 Query 1141     PTFSAGFADPDFDESDHARAVARHLGTEHHVVRIGGADLAGVVESELAVADEPLADPSLL 1320      ||| || +| +||+ +|+||| |+||+|  + +   |   |+ |   +  ||  | | + Sbjct 301     RTFSIGFDEPGYDEAVYAKAVAEHIGTDHTELYVNSKDALDVIPSLPKIYCEPFGDSSQI 360                                       SEQ ID NO: 157 Query 1321     PTRLVCRAAREHVRGVLTGDGADELLLGYRYFQ-AERAIELLLRVlpaprlealvrllvr 1497     || +|   ||+ |   |+||| |||  ||  +|   |   +| |   + |  | Sbjct 361     PTLIVSGLARQQVTVALSGDGGDELFGGYNPYQFTPRVWRMLERFPHSMRRFASAFAQDL 420 Query 1498     rlparSGNLPVTHalgllakglraaPEHRFYLSTAPFGPGELPRLLTPEAGAELTGHDPF 1677      || + | |                 |  ||   + +   | | +     ||+  || Sbjct 421     PLPEKLGKL--------RDVFASRTAEELFYRLNSHWRNHEYPVI-----GAQ-GHTAL 465                      SEQ ID NOS: 158                       159                               SEQ ID NO: 254 Query 1678     TEVSRLLRGQPGLTGVQRSQLAV-VTHFLRDVILTKTDRGGMRSSLELRSPFLDLDLVEY 1854      +        | +   |   +|+ |  ++ | || | ||  | +||| | | +|  + | Sbjct 466     LDTPERW---PRVDSFQHWMMAMDVQGYMPDDILVKVDRAAMANSLETRVPLIDHRVFEL 522               SEQ ID NO: 160 Query 1855     GNSLPTGLKLHRFTGKYLLRQVAAGWLPPSVVQRTKLGFRAPVAAllrgelrpllldtls 2034        +|  +|+    ||+|||+|    +   +++| | ||  ||+  ||| |+      | Sbjct 523     AWRMPLHMKIRNGKGKWLLREVLYRHVSRELIERPKKGFSVPVSDWLRGPLKEWAESLLD 582 Query 2035     psslrrGGLFDTGAVRLLIDDHLGGRRDTSRKLWALLVYQLWFES 2169        |++ |  |+  +| + +||| |||| ||+||++|++| | || Sbjct 583     ERRLOQEGYLDSRLIRRIWNDHLAGRRDHSRRLWSVLMFQAWLES 627

These synthases belong to the huge glutamine amidotransferase family whose members catalyze ATP-dependent amide nitrogen transfer from glutamine to acceptor substrates in different biosynthetic pathways (Zalkin 1998). Particularly, moeF5 appears similar to the WbpS proteins from Pseudomonas aeruginosa O4 and Shigella dysenteriae type 7 which are encoded by genes grouped in clusters for the biosynthesis of O antigens (29% identity and 43% similarity) (Feng 2004, Belanger 1999). The WbpS proteins appear to be responsible for carboxyl-amidation of deoxysugar moieties (Knirel 1988) during antigen biosynthesis in the aforementioned strains. A moeF5 CDD search revealed the presence of a glutaminase domain (AsnB; cd00712) and an interrupted asparagine synthase domain (Asn synthase BC; cd01991) in the N- and C-termini, respectively.

Strain ΔmoeF5 accumulated compound 1 (Table 21), which has a mass 1 Da higher than that of the monosaccharide moe A precursor 2 accumulated by the ΔmoeGT4 strain, consistent with the presence of a carboxyl moiety in unit F of 1 instead of the carboxamide group in 2/3 (FIGS. 4A-D). Our data agree with the prediction that the moeF5 gene is involved in F ring carboxamidation. We could not detect the formation of methylated monosaccharide precursors or of any larger moe A intermediates, implying that the absence of the carboxamide moiety abolishes unit F methylation and subsequent glycosylations. Therefore, MoeFS-catalyzed carboxamidation occurs prior to, and is required for, other modifications of 1 (FIGS. 4A-4D). That is, based on the studies of the present invention, it is propose that the polypeptide encoded by the moeF5 gene is a Unit F amidotransferase which participates in the conversion of Moe intermediate compound 1 in the course of moenomycin biosynthesis to yield a Moe intermediate compound 2 or 3, respectively, as shown in FIGS. 4A-4D.

2. moeH5

The moeF5 translation product also displays local homology (27.1% identity and 34% similarity) to moeH5, another AsnB-like protein. The nucleotide and polypeptide sequences are shown in Tables 39 and 40, respectively. A sequence alignment between moeH5 and the closest homolog identified in the BLAST search is shown in Table 41. MoeH5 shows 32% identity, and 48% similarity to a putative amidotransferase of Azoarcus sp. EbN1. MoeH5 also possesses a truncated asparaginase domain and an entire amidotransferase domain.

TABLE 39 DNA Sequence of moeH5. ATGACGGTCCGCCGCCCGGCCGCGTCCGCCCCCCGCGTCCTCCTGACCGC GGGCCCCGACGGGGTGCGCGTGGAGGGCGACGGGGAGGCGCGCCTCGGGC ACCCCCTCACCGGTGACCACCTGGACCCGGGCCCGCCGGCCGAAGGCGTC TTCGCCGGGTGGAGGTGGGACGGCGAGCGCCTGGTGGCCCGCAACGACCG CTACGGCGTCTGCCCCCTCTTCTACCGGGCCGGCGGCGGCTCACTCGCGC TCTCCCCCGACCCGCTCGCCCTGCTGCCGGAGGACGGGCCCGTCGAGCTG GACCACGACGCGCTCGCCGTCTTCCTGCGGACGGGGTTCTTCCTCGCCGA GGACACGGCCTTCGCACAGGTCCGCGCACTGCCCCCGGCGGCCACGCTCA CCTGGGACACCGGCGGGCTGCGGCTGCGGTCCGACGGGCCGCCGCGCCCC GGGGCCGCCGCGATGACCGAGGCGCAGGCGGTCGACGGCTTCGTCGACCT GTTCCGCGCCTCGGTGGCCCGCCGGCTGCCCGGCGAACCGTACGACCTGC CGCTCAGCGGCGGCCGGGACTCGCGGCACATCCTGCTCGAGCTGTGCCGC CGCGGCGCACCGCCGCGGCGGTGCGTCAGCGGCGCCAAGTTCCCTCCCGA CCCGGGGGCCGACGCGCGCGTGGCGGCCGCCCTGGCGGGCCGGCTCGGTC TGCCGCACACGGTGGTGCCGCGCCCCCGTTCGCAGTTCCGCGCGGAGCTC GCCGCCCTGCCGGCCCAGGGCATGACCACCCTGGACGGCGCGTGGACCCA GCCGGTCCTGGCCCACCTGCGCCGCCACAGCCGCATCTCGTACGACGGTC TCGGCGGCGGGGAGCTCGTCCAGAACCCGAGCGTGGAGTTCATCCGGGCC AACCCCTACGACCCCGCGGACCTGCCCGGCCTGGCGGACCGGTTGCTGGC CGCGAGCCGGACCGGCCCCCACGTGGAGCACCTGCTGAGCCCCCGGACGA ACGCCCTGTGGAGCAGGCAGGCGGCGCGGCGGCGCCTCGTCACCGAGCTG GCCCGGCACGCCGACAGCGCCAGCCCGCTCAGTTCCTTCTTCTTCTGGAA CCGGACCCGGCGCTCCATCTCCGCGGCTCCGTTCGCCCTGGGGGACGGAC GGGTCCTGACGCACACCCCCTACCTCGACCACGCCCTCTTCGACCACCTC GCCTCGGTGCCGCACCGCTTCCTGGTCGACGGGACGTTCCACGACCGGGC GCTGCACCGGGCCTTCCCCGAGCACGCGGACCTGGGGTTCGCCTCGTCGG TGCCCCAGCGGCACGGACCCGTGCTGGTCGCGCACCGAGTGGCGTACCTG CTCCGGTTCCTCGCCCACGCGACGGTCGTGGAACCGGGCTGGTGGCGCGG CCCCGACCGCTTCCTGCAACGGCTGCTGGCCGCCGGCCGGGGGCCCGGGG CCCCGCAGCGCGTCAGCAGGCTGCAGCCCCTGGCGCTCTACCTGCTGCAG TTGGAGGACCTCGCCGTCCGAAGGGCCCGCCGCCGGCCGTAG (SEQ ID NO: 14)

TABLE 40 Amino Acid Sequence of moeH5 MTVRRPAASAPRVLLTAGPDGVRVEGDGEARLGHPLTGDHLDPGPPAEGV FAGWRWDGERLVARNDRYGVCPLFYRAGGGSLALSPDPLALLPEDGPVEL DHDALAVFLRTGFFLAEDTAFAQVRALPPAATLTWDTGGLRLRSDGPPRP GAAAMTEAQAVDGFVDLFRASVARRLPGEPYDLPLSGGRDSRHILLELCR RGAPPRRCVSGAKFPPDPGADARVAAALAGRLGLPHTVVPRPRSQFRAEL AALPAQGMTTLDGAWTQPVLAHLRRHSRISYDGLGGGELVQNPSVEFIRA NPYDPADLPGLADRLLAASRTGPHVEHLLSPRTNALWSRQAARRRLVTEL ARHADSASPLSSFFFWNRTRRSISAAPFALGDGRVLTHTPYLDHALFDHL ASVPHRFLVDGTFHDRALHRAFPEHADLGFASSVPQRHGPVLVAHRLAYL LRFLAHATVVEPGWWRGPDRFLQRLLAAGRGPGAPQRVSRLQPLALYLLQ LEDLAVRRARRRP (SEQ ID NO: 37)

TABLE 41 Sequence Homology of moeH5 ref|YP_159440.1| amidotransferase, similar to asparagine synthase (glutmine- hydrolyzing) [Azoarcus sp, EbN1] Length-642 Score = 70.1 bits (170), Expect = 2e−10 Identities = 54/165 (32%), Positives = 80/165 (48%), Gaps = 5/165 (3%) SEQ ID NO: 161  SEQ ID NO: 162                           SEQ ID NO: 163 Query 48    EGVFAGWRWDG--ERLVARNDRYGVCPLFYRAGGGSLALSPDPLALLPEDGPV-ELDHDA 104    +|+|    ||   +||+   |  || ||+       || + +  |||   |   ||||| Sbjct 121    DGMFNFALWDARRKRLLIGRDPLGVKPLYVHRSASMLAFATEAKALLELPGVTRELDHDV 180    SEQ ID NO: 164                                          SEQ ID NO: 165 Query 105    LAVFLRTGFFLAEDTAFAQVRALPPAATLTWDTGGLR-LRSDGPPRPGAAAMTEAQAVDG 163    +| +|  |+  |  + |  +| ||||  |+ + | +|  |    |   |  +|||+ + Sbjct 181    VADYLHLGYVAAPHSMFRDIRKLPPATLLSVENGEVRQWRYWRLPSSVARYVTEAEWIGR 240                       SEQ ID NO: 246 Query 164    FVDLFRASVARRLPGE-PYDLPLSGGRDSRHILLELCRRGAPPRR 207      |   +|  |++  + |    |||| ||  ++  + +  | | | Sbjct 241    IRDGMERAVHRQMVSDVPIGAFLSGGVDSSAVVAFMAKHSAHPIR 285

Gene moeH5 controls the carboxamidation of unit B (see compound 19), since strain ΔmoeH5 accumulated the moe A precursor 17 (Table 21, FIGS. 4A-4D). Expression of moeR5 in the ΔmoeH5 strain leads to the accumulation of the previously described compound 16 (Zehl 2006), supporting the structure assignment for 17 (FIG. 4). Apparently, underexpression of moeH5 in producing strains leads to the accumulation of moenomycins having the acid form of unit B (compounds 16, 17, 22). Thus, despite having high sequence homology, moeF5 and moeH5 have been shown via gene disruption to play quite different roles in moe A biosynthesis, and they cannot substitute for one another functionally in cross-complementation experiments.

The results described above have showed that MoeH5 amidates the B ring carboxyl group, but they do not explain why this modification occurs. That is, based on the studies of the present invention, it is propose that the polypeptide encoded by the moeH5 gene is a Unit B amidotransferase which participates in the conversion of Moe intermediate compound 16 or 17 in the course of moenomycin biosynthesis to yield a Moe intermediate compound 18 or 19, respectively, as shown in FIGS. 4A-4D. The moeH5-controlled reaction could either be either a branch of moe A metabolism or an essential biosynthetic step prior to “decoration” of the moe A precursor with the unit A chromophore moiety. Unit A biogenesis was proposed to proceed via a MoeB4-catalyzed reaction between an amino cyclopentadione moiety and compound 16/17 (Ostash 2007). However, the presence of a gene dedicated to the conversion of the acid moe A precursor 16/17 into the amide precursor 18/19 (FIG. 4) raised questions about the proposed scheme. We coexpressed the genes for unit A biosynthesis (pOOB64b) in the 38-1+ recombinant strain, which directs the production of 19, and in its ΔmoeH5 derivative, which produces 17. Expression of pOOB64b in ΔmoeH5 yielded no new products (Table 21), while the pOOB64b⁺38-1⁺ strain produced the known compound pholipomycin 21 (Table 21, FIGS. 4A-4D), which contains the Unit A chromophore (Welzel 2005). The inability of pOOB64b⁺ΔmoeH5 strain to produce pholipomycin 21 implies that either 17 is not a precursor to 21 or the moeH5 is essential for moe A chromophore attachment for other reasons. (Schuricht 2000, Petricek 2006, Ostash 2007). At the moment, we propose that carboxamide 19 serves as a necessary intermediate to moe A (FIGS. 4A-4D).

3. moeK5

The putative protein encoded by moeK5 is homologous to radical SAM superfamily enzymes, particularly to a presumed methyltransferase from Pyrococcus horikoshii OT3 (34% identity and 52% similarity). The nucleotide and polypeptide sequences are shown in Tables 42 and 43, respectively. A sequence alignment between moeK5 and the closest homolog identified in the BLAST search is shown in Table 44. The MoeK5 putative translation product showed no apparent similarity to other known sugar C-methyltransferases, such as those exemplified by NovU and TylC3, which require keto group in a position adjacent to methylation site (Thuy 2005, Takahashi 2006). A CDD search revealed radical SAM vitamin B12 binding domain (cd02068) and radical SAM domain (pfam04055) in the N- and C-halves, respectively, of moeK5. Accordingly, moeK5 could function via a SAM radical mechanism, and may not require the transformation of a sugar into an anionic form before methylation.

TABLE 42 DNA Sequence of moeKS. CTGGGTTACATCCACACCGCGCTCAAGTCGGCCGGGTTCCACCACGTCAT CCAGGTCGACACCCCCGCCCTGGGCCTCGACAGCGAGGGGCTGCGCAAGC TGCTCGCGGACTTCGAGCCGGACCTGGTCGGGGTGAGCACCACGACACCC GGTCTGCCCGGCGCCATCGAGGCGTGCGAGGCGGCCAAGAGCACCGGGGC GAAGGTGATCCTGGGCGGGCCGCACACGGAGGTGTACGCGCACGAGAACC TGGTCCACGAGTCCATCGAGTACGTGGGCGTCGGCGAAGGCGTCACGATC ATGCCGGAACTGGCGGAGGCGATGGAGCGGGGCGAGGAGCCGGAGGGCAT CCGCGGCCTGGTGACCCGCAAGCACGACGGCGGTGCCGCGCCGATGGTGA ACCTGGAGGAGGTCGGCTGGCCCGAACGCGCCGGGCTCCCGATGGACCGC TACTACTCGATCATGGCTCCGCGGCCGTTCGCGACGATGATCTCCAGCCG CGGCTGCCCCTTCAAGTGCAGCTTCTGCTTCAAGCAGGCCGTGGACAAGA AGTCCATGTACCGCAGTCCCGAGGACGTCGTCGGTGAGATGACGGAGCTC AAGGAGCGGTGGGGGGTGAAGGAGATCATGTTCTACGACGACGTGTTCAC CCTGCACCGCGGCCGGGTGCGGGAGATCTGCGGGCTCATCGGGGAGACCG GCCTCAAGGTCCGCTGGGAGGCGCCCACCCGCGTCGACCTGGTGCCCGAG CCGCTGCTGGAGGCGATGGCCGGGGCCGGGTGCGTGCGCCTGCGGTTCGG CATCGAGCACGGTGACAGCGAGATCCTCGAGCGGATGCGCAAGGAGAGCG ACATCCAGAAGATCGAGAAGGCCGTCACCTCCGCCCACGAGGCCGGGATC AAGGGCTTCGGGTACTTCATCGTCGGCTGGCTCGGGGAGACCCGGGAGCA GTTCCGCAGGAGCGTCGAGCTCGCCTGCCGCCTCCCGCTGGAGTAGGCCA GCTTCTACACCGCGACGCCCCTGCCGGGCACCCCCCTGCACACGGAGTCC GTGGCCGCCGGCCAGATCCCGCCCGACTACTGGGACCGCTTTTCGTGCGG GGCGAGTTCGACGCGCGGATCGGGTACCTGGTGCCGGACGCGCAGGAGCG CGCCCAGTGGGCGTACCGCTCCTTCTTCATGCGCCGCTCCATGGTCAAGC CGCTGCTGTCGCACATGGCGGTGA (SEQ ID NO: 15)

TABLE 43 Amino Acid Sequence of moeK5 LGYIHTALKSAGFHHVIQVDTPALGLDSEGLRKLLADFEPDLVGVSTTTP GLPGAIEACEAAKSTGAKVILGGPHTEVYAHENLVHESIDYVGVGEGVTI MPELAEAMERGEEPEGIRGLVTRKHDGGAAPMVNLEEVGWPERAGLPMDR YYSIMAPRPFATMISSRGCPFKCSFCFKQAVDKKSMYRSPEDVVGEMTEL KERWGVKEIMFYDDVFTLHRGRVREICGLIGETGLKVRWEAPTRVDLVPE PLLEAMAGAGCVRLRFGIEHGDSEILERMRKESDIQKIEKAVTSAHEAGI KGFGYFIVGWLGETREQFRRTVDLACRLPLDYASFYTATPLPGTPLHTES VAAGQIPPDYWDRFSCGASSTRGSGTWCRTRRSAPSGRTAPSSCAAPWSS RCCRTWR (SEQ ID NO: 38)

TABLE 44 Sequence Homology of moeK5 ref|NP_142754.1| methyltransferase [Pyrococcus horikoshii OT3] Length = 459 Score = 192 bits (489), Expect = 2e−47 Identities = 128/375 (34%), Positives = 195/375 (52%), Gaps = 13/375 (3%) SEQ ID NO: 166 Query 1 LGYIHTALKSAGFHHVIQVDTPALGLDSEGLRKLLADFEPDLVGVSTTTPGLPGAIEACE 60 ||  + |      | |  +|  |  |    + |++  |+||+||++ ||  +  |    + Sbjct 28 LGLAYLASMVREEHDVKIIDGLAEDLTFSDIAKIIKKFDPDIVGITATTSAMYDAYTVAK 87 SEQ ID NO: 247            SEQ ID NO: 167       SEQ ID NO: 168 Query 61 AAKSTGAKV--ILGGPHTEVYAHENLVHES--IDYVGVGEGVTIMPELAEAMERGEEPEG 116  ||+    |  ++||||   +  |  + |   || |  |||     || +|+ +| | +| Sbjct 88 IAKNINENVFVVMGGPHV-TFTPELTMRECPCIDAVVRGEGELTFKELVDALSKGRELKG 146                    SEQ ID NO: 248              SEQ ID NO: 169 Query 117 IRGLVTRKH----DGGAAPMV-NLEEVGWPERAGLPMDRYYSIMAPRPFATMISSRGCPF 171 | ||  +++    +    |++ |++|+  |    ||||+| +   |  |  +++|||||| Sbjct 147 ILGLSYKENGKVRNEPPRPLIQNVDEIPIPSYDLLPMDKYKADGVP--FGVVMTSRGCPF 204                                                 SEQ ID NO: 170           SEQ ID NO: 171 Query 172 KCSFCFKQA-VDKKSMYRSPEDVVGEMTELKERWGVKEIMFYDDVFTLHRGRVREICGLI 230  | ||       |+    | | |+ |++ |   +|+||| | || |||++ |  +|   | Sbjct 205 NCVFCSSSLQFGKRWRGHSVERVIEELSILHYEYGIKEIEFLDDTFTLNKKRAIDISLRI 264 Query 231 GETGLKVRWEAPTRVDLVPEPLLEAMAGAGCVRLRFGIEHGDSEILERMRKESDIQKIEK 290  + || + | | +||+   | + +||   ||  + ||||     ||| + |    |+ Sbjct 265 KQEGLDISWTASSRVNTFNEKVAKAMKEGGCHTVYFGIESASPRILEFIGKGITPQQSID 324 Query 291 AVTSAHEAGIKGFGYFIVGWLGETREQFRRTVDLACRLPLDYASFYTATPLPGTPLHTES 350 || +| + |+   | ||+|+  ||||+   |+  | +| +||| |  ||| ||| |   + Sbjct 325 AVKTAKKFGLHALGSFIIGFPDETREEVEATIKFAKKLDIDYAQFTIATPYPGTRLWEYA 384 Query 351 VAAGQIPPDYWDRFS 365 +|   +    | +++ Sbjct 385 IANNLLLTMNWRKYT 399

Gene moeK5 encodes a protein homologous to putative SAM-radical, methyl-cobalamin-dependent methyl transferases involved in the biosynthesis of fortimycin and a handful of other secondary metabolites, and we have proposed that it controls the methylation of the first sugar (unit F) (Ostash 2007). Indeed, strain ΔmoeK5 accumulated a compound 24 having a mass 14 Da less than that of compound 19 from the parental 38-1⁺ strain (Table 21), indicative of the loss of a methyl group. Methylation of unit F most likely takes place after its attachment to farnesyl-phosphoglycerate since we detected a mixture of nonmethylated and methylated monosaccharides (compounds 2 and 3, respectively; Table 21) in the ΔmoeGT4 strain. Based on the studies of the present invention, it is propose that the polypeptide encoded by the moeK5 gene is a methyltransferase which participates in the conversion of Moe intermediate compound 1 in the course of moenomycin biosynthesis to yield a Moe intermediate compound 2 or 3, respectively, as shown in FIGS. 4A-4D.

4. moeM5

The predicted moeM5 translation product of moeM5 is similar to carbamoyltransferases from NodU family (33% identity and 45% similarity to putative carbamoyltransferase from Rubrobacter xylanophilus DSM9941) (Jabbouri 1995) as well as to those involved in antibiotic biosynthesis in various actinomycetes (29% identity and 44% similarity to GdmN involved in geldanamycin biosynthesis) (Hong 2004). The nucleotide and polypeptide sequences are shown in Tables 45 and 46, respectively. A sequence alignment between moeM5 and the closest homolog identified in the BLAST search is shown in Table 47. MoeK5 and moeM5 may govern the transfer of methyl and carbamoyl groups, respectively, on a moenuronamide precursor.

TABLE 45 DNA Sequence of moeM5. ATGAAGGTACTGTCGCTCCACTCCGCCGGCCACGACACCGGCGTCGCCTA CTTCGAGGACGGGCGGCTGGTCTTCGCGGTCGAGACCGAACGGCTCACCC GGGTCAAGCACGACCACCGCTCCGACGTCGCCCTGCGGCACGTGCTCGAG CAGGAGTGCGTGGACACCGACGGGATCGACCTGGTGGCCGTCAGCACCCC GGTCCGCAGCGGGCTGCTGCGCATACCCGACCTGGACCGGGCCATGGAGC GGATCGGGGCGGGCGCCCTCCACCACCGGACCGTCTGCGAGATGCTGGGG CGGCGGGTGGAGTGCGTCGTGGTCACCCACGAGGTCTCCCACGCGGCGCT GGCCGCCCACTACGCGGACTGGGAGGAAGGCACCGTCGTCCTCGTCAACG AGGGCCGCGGCCAGCTCACCCGCAGCTCCCTGTTCCGGGTGACCGGCGGG GCCCTGGAGTGGGTCGACAAGGACCCGCTGCCCTGGTACGGCAACGGCTT CGGGTGGACGGCGATCGGGTACCTCCTCGGCTTCGGCCCGAGCCCCAGCG TGGCGGGCAAGGTGATGGCCATGGGCGGCTACGGGCAGCCGGACCCGCGC ATCCGCGAACAGCTGCTGTCGGTGGATCCGGAGGTGATGAACGACCGGGA ACTCGCCGAGCGGGTGCGCGCGGACCTGGCCGGCCGGCCCGAGTTCGCCC CCGGGTTCGAGACGGCGTCGCAGGTGGTGGCGACGTTCCAGGAGATGTTC ACCGAGGCCGTCCGGGCGGTGCTCGACCGGCATGTGACGCGCACGGACGC CGGGGTGGGCCCGATCGCCCTGGGCGGCGGGTGCGCCCTGAACATCGTGG CCAACTCGGCGCTGCGGGAGGAGTACGGGCGGGACGTCGCCATCCCGCCC GCCTGCGGGGACGCGGGTCACCTGACGGGCGCCGGCCTCTACGCCCTCGC GCAGGTGGCCGGGGTGAAGCCGGAGCCGTTCAGCGTGTACCGCAACGGCG GGGGCGAGGCCCGGGCCGCCGTCCTGGAGGCGGTGGAGGGCGCGGGGTTG CGGGCCGTTCCCTACGACCGGTCCGCGGTCGCCGGGGTGCTGGCCGGGGG CGGGGTGGTGGCGCTGACGCAGGGAGCGGCGGAACTGGGGCCGCGGGCGC TGGGGCACCGGTCGCTGCTGGGCAGTCCCGCGGTGCCGGGCATGCGCGAG CGGATGAGCGAGAAGCTCAAGCGGCGCGAGTGGTTCCGGCCGCTGGGCGC CGTGATGCGCGACGAGCGCTTCGCCGGGCTGTACCCGGGGCGGGCGCCGT CGCCGTACATGCTCTTCGAGTACCGGCTGCCGGACGGGATCGCGCCCGAG GCCCGGCACGTCAACGGCACCTGCCGGATCCAGACCCTGGGCCCCGAGGA GGACCGGCTGTACGGTCTGCTCGCCGAGTTCGAGGAGCTGAGCGGTGTGC CGGCGCTGATCAACACGTCGCTCAACGGCCCGGGCAAGCCCATCGCGCAC ACCGCCCGGGACGTGCTCGACGACTTCGCGCGCACCGACGTCGACCTCTT CGTGTTCGACGACCTGATGGTGCGGGGCGCCGCCGCGCGGTAG (SEQ ID NO: 16)

TABLE 46 Amino Acid Sequence of moeM5 MKVLSLHSAGHDTGVAYFEDGRLVFAVETERLTRVKHDHRSDVALRHVLE QECVDTDGIDLVAVSTPVRSGLLRIPDLDRAMERIGAGALHHRTVCEMLG RRVECVVVTHEVSHAALAAHYADWEEGTVVLVNEGRGQLTRSSLFRVTGG ALEWVDKDPLPWYGNGFGWTAIGYLLGFGPSPSVAGKVMAMGGYGQPDPR IREQLLSVDPEVMNDRELAERVRADLAGRPEFAPGFETASQVVATFQEMF TEAVRAVLDRHVTRTDAGVGPIALGGGCALNIVANSALREEYGRDVAIPP ACGDAGHLTGAGLYALAQVAGVKPEPFSVYRNGGGEARAAVLEAVEGAGL RAVPYDRSAVAGVLAGGGVVALTQGAAELGPRALGHRSLLGSPAVPGMRE RMSEKLKRREWFRPLGAVMRDERFAGLYPGRAPSPYMLFEYRLPDGIAPE ARHVNGTCRIQTLGPEEDRLYGLLAEFEELSGVPALINTSLNGPGKPIAH TARDVLDDFARTDVDLFVFDDLMVRGAAAR (SEQ ID NO: 39)

TABLE 47 Sequence Homology of moeM5 gb|AAO06921.1|GdmN [Streptomyces hygroscopicus] Length = 682 Score = 159 bits (401), Expect = 4e−37 Identities = 167/557 (29%), Positives = 246/557 (44%), Gaps = 49/557 (8%)      SEQ ID NO: 172                    SEQ ID NO: 173 Query 11      HDTGVAYFEDGRLVFAVETERLTRVKHDHRSDV-ALRHVLEQECVDTDGIDLVAVSTP-- 67      ||+  +   || || ||| ||| |+|   +  + |+|  |       + +| |    | Sbjct 27      HDSAASLIRDGELVAAVEEERLNRIKKTTKFPLNAVRECLALAGARPEDVDAVGYYFPEN 86      SEQ ID NO: 174 SEQ ID NO: 175  SEQ ID NO: 235                SEQ ID NO: 176 Query 68      -VRSGLLRI-PDLDRAMERIGAGALHHRTVCEMLGRRV---ECVVVTHEVSHAALAAHYA 122       + + |  +  +  ||  |     +  | + | || +   + | | |  +||  +++ Sbjct 87      HIDTVLNHLYTEYPRAPLRYSRELIRQR-LKEGLGWDLPDEKLVYVPHHEAHA-YSSYLH 144                                   SEQ ID NO: 177           SEQ ID NO: 178 Query 123      DWEEGTVVLVNEGRGQLTRSSLFRVTGGALEWVDKDPLPWYGNGFGWTAIGYLLGFGPSP 182         +  +||| +|||+|   +++|  |  || +   |+|    |    |  ||||+| Sbjct 145      SGMDSALVLVLDGRGELHSGTVYRAEGTRLEKLADYPVPKSLGGLYLNAT-YLLGYGFGD 203                                                         SEQ ID NO: 236                                                         SEQ ID NO: 179 Query 183      SVAGKVMAMGGYGQPDPRIREQLLSVDPEVMNDRELAERVRADLAGRPEF-APGF----- 236          ||| +  +| |+            +   + ||   +       | | | || Sbjct 204      EY--KVMGLAPWGNPETYRDTFAKLYTLQDNGEYELHGNIMVPNLVSPLFYAEGFRPRRK 261          SEQ ID NO: 180       SEQ ID NO: 181                 SEQ ID NO: 182 Query 237      -ETASQVVATFQEMFTEAVRAVLDRHVTR---TDAGVGPIALGGGCALNIVANSA-LREE 291       |  +|    |     | |  ++  |+            +|   +  ||| | |   |   |+ Sbjct 262      GEPFTQAHRDFAAALQETVEKIV-LHILEYWAKTSGHSRLCFGGGVAHNSSLNGLILKSG 320                              SEQ ID NO: 237      SEQ ID NO: 183                                  SEQ ID NO: 184 Query 292      YGRDVAIPPACGDAGHLTGAGLYALAQVAGVKPEPFSVYRN-------GGGEARAAVLEA 344         +| + ||  |||   ||  || |   |    |    +       || |   | | Sbjct 321      LFDEVFVHPASHDAGAGEGAA-YAAAASLGTLERPGKRLLSASLGPALGGREQIRARL-- 377                            SEQ ID NO: 249                         SEQ ID NO: 185 Query 345      VEGAGLRAVPYDRSAV---AGVLAGGGVVALTQGAAELGPRALGHRSLLGSPAVPGMRER 401       + | |  | +   ||   ||+|| | |+    | +| |||||||||++        | | Sbjct 378      ADWAPLIDVEFPDDAVETAAGLLAEGQVLGWAYGRSEFGPRALGHRSIVADARPEENRTR 437      SEQ ID NO: 238                                      SEQ ID NO: 250 Query 402      MSEKLKRREWFRPLGAVMRDERFAGLYP-GRAPSPYMLFEYRLPDGIAPEAR-------H 453      ++  +|+|| |||   |+  |     +    |   +    + +|  + || |       | Sbjct 438      INAMVKKREGFRPFAPVVTAEAARDYFDLSGADGNHEFMSFVVP--VLPERRTELGAVTH 495                                                    SEQ ID NO: 186      SEQ ID NO: 187  SEQ ID NO: 188 Query 454      VNGTCRIQTLGPEE-DRLYGLLAEFEELSGVPALINTSLNGPGKPIAHTARDVLDDFART 512      |+|| |+| +  |  +| + |+  | ||+| | |+||| |  +||  +  ||+  |  | Sbjct 496      VDGTARVQVVSAESGERFHRLVRRFGELTGTPVLLNTSFNNNAEPIVQSLDDVVTSFLTT 555 Query 513      DVDLFVFDDLMVRGAAA 529      |+|+ | +| +||| |+ Sbjct 556      DLDVLVVEDCLVRGKAS 572

To evaluate whether sugar tailoring reactions (particularly, O-carbamoylation of unit F) follow the formation of the lipid-phosphoglycerate-pentasaccharide scaffold of moe A, the carbamoyltransferase gene moeM5 was disrupted (see e.g., FIGS. 9A-9B). The mutant strain, termed OB20a, was then evaluated for moe A function; extracts from the moeMY mutant should contain less active moe A derivatives, lacking a carbomoyl group. Indeed, moeM5 deficient mutants have been shown to produce novel moe compounds with greatly reduced antibacterial activity. The molecular mass of such compounds (m/z 1538 Da; see e.g., FIGS. 12A-12D and 13A-13B) coincides with that of moe A lacking the carbamoyl group. Expression of a functional moeM5 gene in the OB20a mutant restored moe A biosynthesis. The purification of the intermediate accumulated in OB20a in quantities sufficient for more detailed structural elucidation has been hampered by its instability and very low levels.

Nevertheless, several conclusions may be drawn from the data obtained. First, moeM5 appears to govern carbamoylation of a moe A intermediate, which is one of several tailoring reactions involved in moes bioactivity.

Second, blocked carbamoylation does not appear to abolish the formation of pentasaccharide moiety of moe A. The ability to remove a certain chemical moiety from a given position of moe in order to obtain a more valuable derivative or to modify this position chemically would be very beneficial. For example, manipulations of genes responsible for introduction of carbamoyl, methyl, and amido groups into moe molecules and those involved in lipid-phosphoglycerate assembly are of interest since these functionalities contribute to moe bioactivity. However, it was not previously known whether the disruptions of a gene governing a certain catalytic step would lead to the production of desired intermediate. For instance, the absence of a specific chemical group on a first sugar might block the attachment of the second sugar, and thus the assembly of entire lipid-pentasaccharide scaffold would be interrupted.

Based on the studies of the present invention, it is propose that the polypeptide encoded by the moeM5 gene is a carbamoyltransferase which participates in the conversion of Moe intermediate compound 5, 6 or 7 in the course of moenomycin biosynthesis to yield a Moe intermediate compound 8, 9 or 10, respectively, as shown in FIGS. 4A-4D.

Here, however, it was demonstrated that the generation of biologically active moe derivatives through genetic engineering is possible. In this instance, either carbamoyltransfer takes place after the glycoside scaffold of moe A is formed, or moe GTs possess a certain level of substrate flexibility allowing them to recognize sugars that have no specific functional groups. The result demonstrates that it is possible to switch off late steps of moe biosynthesis without disturbing the assembly of the complex pharmacophore scaffold.

5. moeR5

The putative translation product of the moeR5 gene resembles the C-terminal portion of the CapD-like NAD-dependent epimerases/dehydratases involved in capsular polysaccharide biosynthesis in various bacteria. (53% identity and 68% similarity to putative CapD protein from Nocardioides sp. JS614) (Lin 1994, Smith 1999). The nucleotide and polypeptide sequences are shown in Tables 48 and 49, respectively. A sequence alignment between moeR5 and the closest homolog identified in the BLAST search is shown in Table 50.

We have previously proposed that moeR5moeS5 encode a 4,6-dehydratase/ketoreductase pair that controls the conversion of UDP-GlcNAc into UDP-chinovosamine (Ostash 2007), the unit C sugar of moe A (FIG. 1). Consistent with this, the 38-1+ strain was shown to accumulate a moe A derivative containing GlcNAc in place of chinovosamine. A moeR5⁺moeS5⁺ 38-1⁺ strain accumulated a compound, 18, having an accurate mass and fragmentation pattern identical to a previously characterized moe A precursor (FIGS. 4A-4D and (Ostash 2007). While this result was expected, we were surprised to find that the moeR5⁺38-1⁺ strain also produces 18 (Table 21). There is a close moeS5 homolog in the S. coelicolor genome (Ostash 2007), and it is probable that a similar homolog exists in the S. lividans genome and complements the loss of moeS5 function in the moeR5⁺38-1⁺ strain. However, we cannot rule out the possibility that MoeR5 catalyzes both reactions. Co-expression of moeS5 and moeno38-1 yielded no new products (data not shown).

TABLE 48 DNA Sequence of moeR5 ATGTTTGGCGATAATTCCGTGGGGTACGACGCGAACTTTCCGGCCGGTGG ACCTCTCACCTTGGACCTCGAGAGGATTATCGGCCGCCAACGAATAAGGA CCGGTCTCGAGAGCAGCGCCGGATTACTGCGCGGCCGACGGATCCTGGTC ACCGGAGCCGGCGGCTACATCGGATCGGAACTGTGCCGGCAGCTCAGCCG GTGGGAACCCGAGAGCCTCATGATGCTCGACCGGAACGAGACGGCCCTCC ACCTGGCGGCCACCAGCATCGGGAACGTCTCCCCGTCGGTGCGGACCTCC ATCCTCCTCGCGGACATCAGGGACTCCAGAGGGCTCGCCCGGCTGTTCCA GCAGTGCCGGCCGGACACCGTCTTCCACGCGGCGGCCCTCAAATGGGTGC CCATCCTGGAGAAGTTCCCCGGGGAAGCCGTCAAGACGAATGTCTTCGGC ACCCGAGCGGTGCTCGAGGCGGCCCTGGCCGCGGACGTCGCGTTCCTGGT GAACATCTCGACCGACAAGGCGGTCGATCCGGTCGGGGTGCTCGGATACT CGAAACGCATAGCCGAAGGACTCACCGCGGCGGCCGCGATCCAGGCGGGC AGACCGTACGTGAGCGTGCGCTTCGGCAACGTGCTCGGTTGCCAGGGGTC CTTCCTCGACGTCTTCGCCCGGCAGATCGCGGCCGGCAGACCGGTGACGG TCACCCACCCCGAGGTGACGCGCTATCTGATGACCGTCCAGGAGGCCGTG GAACTGGTCATCCAGTCGGTCGCGCTGGGCAGCGTCGGCCACGCCCTGGT CCTGGACATGGGGGAACAGGTCCGGATCCTCGACATCGCCAGAAGGCTCA TCGCGCACGCCGGTGCGGAGCTCCCGGTCCGCTACGTCGGGCTGCGGCCG GGGGAGAAGCTCACCGAGGCGCTGGTGGCCCCTTCCGAGTCCCCGGTCCG GCACGGGCATCCGAAGATCATGGAAGTGCCGGTGCCGGCCCTGAAGGCGG GGGACGGCCCGGAACTCGACGCCTGGGGCGAGGACCAGGCCGTCGTCGCC GCCCTGCGCGCCACCTGCCTCGCCATGGCGGGCGACGACCCGGTGGCGCA GGACCCCGGCCACCGGCTGGTCTGA (SEQ ID NO: 17)

TABLE 49 Amino Acid Sequence of moeR5 MFGDNSVGYDANFPAGGPLTLDLERIIGRQRIRTGLESSAGLLRGRRILV TGAGGYIGSELCRQLSRWEPESLMMLDRNETALHLAATSIGNVSPSVRTS ILLADIRDSRGLARLFQQCRPDTVFHAAALKWVPILEKFPGEAVKTNVFG TRAVLEAALAADVAFLVNISTDKAVDPVGVLGYSKRIAEGLTAAAAIQAG RPYVSVRFGNVLGCQGSFLDVFARQIAAGRPVTVTHPEVTRYLMTVQEAV ELVIQSVALGSVGHALVLDMGEQVRILDIARRLIAHAGAELPVRYVGLRP GEKLTEALVAPSESPVRHGHPKIMEVPVPALKAGDGPELDAWGEDQAVVA ALRATCLAMAGDDPVAQDPGHRLV (SEQ ID NO: 40)

TABLE 50 Sequence Homology of moeR5 gi|71367042|ref|ZP_00657575.1|Polysaccharide biosynthesis protein CapD [Nocardioides sp. JS614] gi|71157263|gb|EAO07657.1|Polysaccharide biosynthesis protein CapD [Nocardioides sp. JS614] Length = 667 Score = 322 bits (824), Expect = 2e−86, Method: Composition-based stats. Identities = 185/346 (53%), Positives = 237/346 (68%), Gaps = 2/346 (0%) SEQ ID NO: 189 Query 21 LDLERIIGRQRIRTGLESSAGLLRGRRILVTGAGGYIGSELCRQLSRWEPESLMMLDRNE 80 +++  ++|| ++ | + | || | ||++||||||| ||||||||+ |++|  ||||||+| Sbjct 317 INITDVLGRNQLDTDVASIAGYLAGRKVLVTGAGGSIGSELCRQIYRYQPAELMMLDRDE 376 SEQ ID NO: 190 Query 81 TALHLAATSIGNVSPSVRTSILLADIRDSRGLARLFOQCRPDTVFHAAALKWVPILEKFP 140 +|||    ||   +      ++| |||| + +  +|   ||| |||||| +|+||++| Sbjct 377 SALHRVQLSIHGRALLDSDDVILCDIRDEKAVRTIFANRRPDVVFHAAALKHLPMLEQYP 436 Query 141 GEAVKTNVFGTRavleaalaadvaflvNISTDKAVDPVGVLGYSKRIAEGLTAAAAIQAG 200  ||||||| ||| ||+||    |    |||||||| +|  |||||||+|| +||| | +| Sbjct 437 AEAVKTNVIGTRTVLDAADLVGVDRFVNISTDKAANPSSVLGYSKRVAERITAAQAREAS 496 Query 201 RPYVSVRFGNVLGCQGSFLDVFARQIAAGRPVTVTHPEVTRYLMTVQEAVELVIQSVALG 260   |+||||||||| +|| |  |||||||| |+|||||+|+|+ ||++|| +||||+ |+| Sbjct 497 GTYLSVRFGNVLGSRGSVLAAFARQIAAGGPITVTHPDVSRFFMTIEEACQLVIQAAAIG 556                                                       SEQ ID NO: 191 Query 261 SVGHALVLDMGEQVRILDIARRLIAHAGAELPVRYVGLRPGEKLTEALVAPSES-PVRHG 319   | |||||||| |+|+|+| +||  ||  +|+ | ||| |||| | |    |   || Sbjct 557 GPGEALVLDMGEPVKIVDVAEQLIEQAGTPVPIEYTGLREGEKLHEELFGEGEPCDVRPR 616                                        SEQ ID NO: 192 Query 320 HPKIMEVPVPALKAGDGPELDAWGEDQAVVAALRATCL-AMAGDDP 364 || +  |||| +  |+   |   ||   |  ||   || ++  ||| Sbjct 617 HPLVSHVPVPPITDGEVLGLTLVGEPDDVRQALHDACLVSIEADDP 662

Based on the studies of the present invention, it is propose that the polypeptide encoded by the moeR5 gene is a hexose-4,6-dehydratase which participates in the conversion of Moe intermediate compound 4 in the course of moenomycin biosynthesis to yield a Moe intermediate compound 6 or 7, wherein chinovosamine is utilized as a donor, respectively, as shown in FIGS. 4A-4D. Furthermore, based on the studies of the present invention, it is propose that the polypeptide encoded by the moeR5 gene is a hexose-4,6-dehydratase which participates in the conversion of Moe intermediate compound 11 in the course of moenomycin biosynthesis to yield a Moe intermediate compound 14 or 15, wherein chinovosamine is utilized as a donor, respectively, as shown in FIGS. 4A-4D.

6. moeS5

Gene moeS5, located upstream of moeR5, encodes a putative protein homologous to putative polysaccharide biosynthesis protein SCO7194 from S. coelicolor A3(2) (63% identity and 76% similarity). The nucleotide and polypeptide sequences are shown in Tables 51 and 52, respectively. A sequence alignment between moeS5 and the closest homolog identified in the BLAST search is shown in Table 53. A CDD search revealed an RfbD domain (COG1091) in moeS5. This domain is present in many known NDP-4-dehydrohexose reductases. It is possible that the moeR5 and moeS5 genes may govern two consecutive steps of NDP-Glc-Nac transformation into NDP-chinovosamine (unit C of moe A; FIG. 1). Particularly, the NDP-Glc-NAc-4, 6-dehydratase activity of moeR5 may convert NDP-Glc-NAc into NDP-4-keto-6deoxy-Glc-Nac, and the hexose-4-ketoreductase activity of moeS5 may reduce this intermediate to yield NDP-chinovosamine.

TABLE 51 DNA Sequence of moeS5 GTGAGAGTTCTTGTCGTCGGCGGGAGCGGCTTCCTCGGGTAGGAGGTGCT CCGCCGGGCCGTGGCCGCCGGGTGGGACGTGGCCGCGACCTACCGGACCC GCCCCGAGGAACTGCCGCCGGTCACCTGGTACCGGGCCGACCTCCGTGAC CCGGGGCGGATGGGAGAGGTGCTGGCCCGGACCCGGCCGGCCGCGGTGAT CAACGCGTCGAGCGGACACGCCGACTGGGCGGTCACGGCCGACGGCGCGG CCCGCCTCGCCCTGGAGGCGGCGCGCGCCGGCTGCCGACTAGTCCACGTC TCCTCCGACGCCGTGTTCTCCGGAGCCGACGTCCACTACCCGGAGGAGGC CCTCCCCGACCCCGTCTCCCCGTACGGCGCGGCCAAGGCCGCGGCGGAGA CGGCCGTCAGGGTGGCCGTGCCCGAGGCCGCCGTGGTGCGCACCTCGCTC ATCGTGGGGCACAACCGGTCCGCCCACGAGGAGGCGGTGCACGCCCTGGC GGCCGGCCGGCGCGCCGGCGTCCTGTTCACGGACGACGTCCGCTGTCCGG TCCACGTCGACGATCTGGCCTCCGCGCTTTTGGAGATCGCGGCGTCGGAC GGGTCCGGGGTGTTCCACGTGGCGGGACCGGACGCGATGAACCGTCACGA CCTGGGTGTCCTCATAGCCCGGCGGGACGGACTGGACCCGGCCCGGCTGC CGGCCGGTCTGCGGAGCGAGGTGGCCCCGCCGGGGAACCTCGACATCCGT CTCGTCACCGATGCCACGCGGGCCCGGCTCCGGACCCGGTTGCGGGGCGC GCGCGAATTCCTCGGCCCCGGCGTTCCGGTGACGCGGGGCGTCCGTTGA (SEQ ID NO: 18)

TABLE 52 Amino Acid Sequence of moeS5 VRVLVVGGSGFLGYEVLRRAVAAGWDVAATYRTRPEELPPVTWRADLRDP GRMGEVLARTRPAAVINASSGHADWAVTADGAARLALEAARAGCRLVHVS SDAVFSGADVHYPEEALPDPVSPYGAAKAAAETAVRVAVPEAAVVRTSLI VGHNRSAHEEAVHALAAGRRAGVLFTDDVRCPVHVDDLASALLEIAASDG SGVFHVAGPDAMNRHDLGVLIARRDGLDPARLPAGLRSEVAPPGNLDIRL VTDATRARLRTRLRGAREFLGPGVPVTRGVR (SEQ ID NO: 41)

TABLE 53 Sequence Homology of moeS5 emb|CAC01594.1|putative polysaccharide biosynthesis protein [Streptomyces coelicolor A3(2)] SCO7194 Length = 271 Score = 335 bits (860), Expect = 1e−90 Identities = 171/269 (63%), Positives = 205/269 (76%), Gaps = 0/269 (0%) SEQ ID NO: 193 Query 3 VLVVGGSGFLGYEVLRRAVAAGWDVAATYRTRPEELPPVTWYRADLRDPGRMXEVLARTR 62 ||||||||||| |++|+| |||  ||||+ ||| + |  ||+  ||||  |+ ||+| Sbjct 3 VLVVGGSGFLGTELVRQASAAGHRVAATFATRPCDGPEATWHEVDLRDGARVEEVVASLA 62 SEQ ID NO: 194 Query 63 PAAVINASSGHADWAVTADGAARLALEAARAGCRLVHVSSDAVFSGADVHYPEEALPDPV 122 |  ||||||| |||||||+|+ |||+ | +  ||||||||||||||+ ||| |  ||||| Sbjct 63 PCVVINASSGSADWAVTAEGSVRLAMTAVKYDCRLVHVSSDAVFSGSRVHYDESCLPDPV 122 Query 123 SPYGAAKAAAETAVRVAVPEAAVVRTSLIVGHNRSAHEEAVHALAAGRRAGVLFTDDVRC 182 ||||+|||+||||+| +   || |+|| ||++|| ||||||+||||| +||| |||+  + Sbjct 123 TPYGAAKAAAETGIRLLAPAAVIARTSLIIGGIQSEHVRLVHDLATGSRTGALFTDDVRC 182 Query 183 PVHVDDLASALLEIAASDGSGVFHVAGPDAMNRHDLGVLIARRDGLDPARLPAGLRSEVA 242 ||||+|||+||||+| +   || |+|| ||+++|| ||||||+||||| +||| |||+  + Sbjct 183 PVHVEDLAAALLELAFTGACGVHHLAGKDAVSRHGLGVLIAQRDGLDASRLPEGLRAGTS 242 Query 243 PPGNLDIRLVTDATRARLRTRLRGAREFL 271   | |+|| + ||||+||||+||  ||| Sbjct 243 LSGALDVRLDSRATRAKLRTRVRGVHEFL 271

Based on the studies of the present invention, it is propose that the polypeptide encoded by the moeS5 gene is a hexose-4-ketoreductase which participates in the conversion of Moe intermediate compound 4 in the course of moenomycin biosynthesis to yield a Moe intermediate compound 6 or 7, wherein chinovosamine is utilized as a donor, respectively, as shown in FIGS. 4A-4D. Further, based on the studies of the present invention, it is propose that the polypeptide encoded by the moeS5 gene is a hexose-4-ketoreductase which participates in the conversion of Moe intermediate compound 11 in the course of moenomycin biosynthesis to yield a Moe intermediate compound 14 or 15, wherein chinovosamine is utilized as a donor, respectively, as shown in FIGS. 4A-4D.

7. moeE5

The putative protein encoded by the moeE5 gene appears the most similar to the putative NDP-hexose 4-epimerase from Symbiobacterium thermophilum IAM14863 (46% identity and 58% similarity) and other known epimerases. The nucleotide and polypeptide sequences are shown in Tables 54 and 55, respectively. A sequence alignment between moeE5 and the closest homolog identified in the BLAST search is shown in Table 56. S. ghanaensis produces moes A₁₂ and C₁ as minor components of moe complex in which unit F (moenuronamide) has D-galacto configuration (and not D-gluco as in Moe A) (Welzel 2005). Such a rearrangement requires epimerization of hydroxyl group in 4^(th) position of hexose ring and moeE5 protein appears to fit this role.

TABLE 54 DNA Sequence of moeE5. GTGTCGAGCGATACACACGGAACGGACTTAGCGGACGGCGACGTTTTGGT CACCGGTGCGGCCGGCTTCATCGGGTCGCACCTGGTGACGGAACTGAGGA ATTCCGGCAGAAACGTTGTGGCGGTGGACCGGAGACCCCTTCCGGACGAC TTGGAGAGTAGGTCCCCGCCCTTTACCGGTTCGCTCCGGGAGATACGCGG TGAGCTCAACTCATTGAATCTGGTGGAGTGCCTGAAAAACATCTCGACGG TCTTCCACTTGGCCGCGTTACCCGGAGTCCGCCCGTCCTGGACCCAATTC CCCGAGTACCTCCGGTGCAATGTACTGGCGACCCAGCGCCTGATGGAGGC CTGTGTGCAGGCCGGCGTGGAACGCGTGGTGGTCGCCTCGTCCTCCAGCG TCTACGGCGGCGCGGACGGCGTGATGAGCGAGGACGACCTGCCCCGTCCG CTCTCCCCCTACGGGGTCACCAAACTCGCCGCGGAGCGGCTGGCCCTGGC CTTCGCGGCCCGCGGCGACGCCGAGCTCTCGGTCGGCGCCCTGAGGTTCT TCACCGTCTAGGGCCCCGGCCAGCGCCCGGACATGTTCATCTCCCGGCTG ATCCGGGCGACGCTCCGGGGCGAACCCGTCGAGATCTACGGCGACGGGAC CCAGCTCCGCGACTTCACCCATGTGTCCGACGTGGTGCGGGCGCTGATGC TGACCGCGTCGGTGCGGGACCGGGGCAGCGCGGTGCTGAACATCGGCACC GGGAGCGCCGTCTCGGTCAACGAAGTGGTCTCCATGACCGCGGAGCTGAC CGGTCTGCGCCCGTGCACCGCGTACGGTTCCGCCCGCATCGGCGACGTCC GCTCGACCACCGCCGACGTGCGGCAGGCCCAGAGCGTCCTGGGCTTCACG GCCCGGACGGGTCTGCGGGAAGGTCTCGCCACCCAGATCGAGTGGACCCG GCGGTCACTGTCCGGCGCCGAGCAGGACACCGTCCCGGTCGGCGGCTCCT CGGTGTCCGTGCCGCGGCTGTAG (SEQ ID NO: 19)

TABLE 55 Amino Acid Sequence of moeE5 VSSDTHGTDLADGDVLVTGAAGFIGSHLVTELRNSGRNVVAVDRRPLPDD LESTSPPFTGSLREIRGDLNSLNLVDCLKNISTVFHLAALPGVRPSWTQF PEYLRCNVLATQRLMEACVQAGVERVVVASSSSVYGGADGVMSEDDLPRP LSPYGVTKLAAERLALAFAARGDAELSVGALRFFTVYGPGQRPDMFISRL IRATLRGEPVEIYGDGTQLRDFTHVSDVVRALMLTASVRDRGSAVLNIGT GSAVSVNEVVSMTAELTGLRPCTAYGSARIGDVRSTTADVRQAQSVLGFT ARTGLREGLATQIEWTRRSLSGAEQDTVPVGGSSVSVPRL (SEQ ID NO: 42)

TABLE 56 Sequence Homology of moeE5 ref|YP_074610.1|UDP-glucose 4-epimerase [Symbiobacterium thermophilum IAM 14863] Length = 292 Score = 230 bits (587), Expect = 6e−59 Identities = 138/300 (46%), Positives = 175/300 (58%), Gaps = 18/300 (6%) SEQ ID NO: 195 Query 16 LVTGAAGFIGSHLVTELRNSGRNVVAVDRRPLPDDLESTSPPFTGSLREIRGDLNSLNLV 75 ||||||||||||||  || +| +|| |||||  |              + ||| +|+| Sbjct 5 LVTGAAGFIGSHLVEALRAAGHDVVGVDRRPGAD---------------VVGDLLTLDLA 49 SEQ ID NO: 196                                   SEQ ID NO: 197 Query 76 DCLKNISTVFHLAALPGVRPSWTQFPEYLRCNVLATQRLMEACVQAGVERVVVASSSSVY 135   |  +  | |||  |||| ||+||| ||  |+  ||||+|+     +++ |+||+|||| Sbjct 50 PLLDGVEYVVHLAGQPGVRESWSQFPAYLAGNLQTTQRLLESLRDRPLKKFVLASTSSVY 109 Query 136 GGADGVMSEDDLPRPLSPYGVTKLAAERLALAFAARGDAELSVGALRFFTVYGPGQRPDM 195 |       ||    |+||||+||||||+|   +     | +   |||+|||||| ||||| Sbjct 110 GEVPMPAREDGPAMPVSPYGLTKLAAEKLCDLYGR--TAGIPWVALRYFTVYGPRQRPDM 167                                      SEQ ID NO: 198 Query 196 FISRLIRATLRGEPVEIYGDGTQLRDFTHVSDVVRALMLTASVRDRGSAVLNIGTGSAVS 255   ||   | | |||++|||||+||||||+|+| | |    |++       +|+| ||||+ Sbjct 168 AFSRWFNAALDGEPIQIYGDGSQLRDFTYVADAVTATQ-RAALNPVVGVPINVGGGSAVT 226                                        SEQ ID NO: 199 Query 256 VNEVVSMTAELTGLRPCTAYGSARIGDVRSTTADVRQAQSVLGFTARTGLREGLATQIEW 315 | | + + | +||            ||+| | ||  +    +||   | | |||  |  | Sbjct 227 VREAIRLIAAITGRPIRIRQLPPAPGDMRETRADTERLWREVGFRPSTPLEEGLWQQYRW 286

Based on the studies of the present invention, it is propose that the polypeptide encoded by the moeE5 gene is a NDP-hexose 4-epimerase which participates in the conversion of Moe intermediate compound 1 in the course of moenomycin biosynthesis to yield a Moe intermediate compound 2 or 3, as shown in FIGS. 4A-4D.

D. Genes for Phosphoglycerate-Lipid Moiety Biosynthesis

Two genes in cluster 1 were identified that fit this functional profile: moeN5 and moeO5. The phosphoglycerate-moenocinol chain of moenomycin is unusual in containing a cis-allylic ether linkage and an irregular isoprenoid chain of uncertain biosynthetic provenance. Two putative prenyltransferases, moeO5 and moeN5, were identified via in silico analysis of the moe genes and proposed to participate in the production of the phosphoglycerate-lipid moiety (Ostash 2007).

1. moeN5

The product of moeN5 gene translation shows local homology to putative geranylgeranyl pyrophosphate synthase from Chlamidia trachomatis (30% identity and 58% similarity over 56 amino acid fragment). The nucleotide and polypeptide sequences are shown in Tables 57 and 58, respectively. A sequence alignment between moeN5 and the closest homolog identified in the BLAST search is shown in Table 59.

TABLE 57 DNA Sequence of moeN5. ATGCTCGCCGCCGAGGCCGCCAACCGCGACCATGTCACGCGGTGCGTCGC GCAGACCGGTGGGTCGCCGGACCTGGTGGCGCACACCGCCGCCCTGCGCC TGTACCTGAGGGTGCCCCACTTCCTCACCGAGTGGACGACCGACCCGGAC CGGCGGGCCGCGGTGTCCCGCGCGCTGGCCCTCGACATCGTCTCCATGAA GCTCCTCGACGACCTGATGGACGACGACACCGGACTCGACCGGGTCGAAC TCGCCTGTGTCTGCCTCCGCCTCCACCTGCGGGCGCTGCACGAACTGGAA TCCCTCGCCCGGGACCCCAAGGCGGTGACGGACATCCTGGAGCAGGACGC CGTCCACCTCTGCGGCGGCCAGATACGCACCAAACGCTCTCGGGCGACGA ACCTCCGGGAGTGGCGCGCCCATGCGAGCACCTACGGCTCCACCTTCCTG GGCCGCTACGGGGCACTCGCGGCCGCCTGCGGGGGGGAAGGCCAACCGGC GGACTCCGTAAGGGAGTTCGCAGAGGCTTTCGCCATGACCATCACCATGG CGGACGACCTGACCGACTACGACCGCAACGGCGAGCGGGACGGCAACCTC GCCCATCTGATGCGGACCGGGGCCGTGGCCGGCCAGGACGTCGTGGACCT GCTGGAGGAGCTGCGCGGGCGGGCCCTCGCCGCGGTGGCGGCACCGCCCG GCGCGCCCGGTCTGGTGCCGGTCGTCCACCTCTACACGGACGACGTGCTG GTACGGCTGCTTCCCCGGCACCTGGGGGAGTGA (SEQ ID NO: 20)

TABLE 58 Amino Acid Sequence of moeN5 MLAAEAANRDHVTRCVAQTGGSPDLVAHTAALRLYLRVPHFLTEWTTDPD RRAAVSRALALDIVSMKLLDDLMDDDTGLDRVELACVCLRLHLRSLHELE SLARDPKAVTDILEQDAVHLCGGQIRTKRSRATNLREWRAHASTYGSTFL GRYGALAAACGGEGQPADSVREFAEAFAMTITMADDLTDYDRNGERDGNL AHLMRTGAVAGQDVVDLLEELRGRALAAVAAPPGAPGLVPVVHLYTDDVL VRLLPRHLGE (SEQ ID NO: 43)

TABLE 59 Sequence Homology of moeN5 gb|EAQ07619.1|Geranylgeranyl pyrophosphate synthetase [Loktanella vestfoldensis SKA53] Length = 293 Score = 40.0 bits (92), Expect = 0.097 Identities = 35/119 (29%), Positives = 52/119 (43%), Gaps = 8/119 (6%) SEQ ID NO: 200             SEQ ID NO: 201 Query 141 HASTYGSTFLGRYGALAAACGGEGQP-ADSVREFAEAFAMTITMADDLTDYDRNGERDGN 199 | +  |+ |+      | | | | +| |+      ||| +   + | | |    |+  | Sbjct 167 HQAKTGALFIAATQMGAVAAGQEAEPWAELGARIGEAFQVADDLRDALCDDATLGKPAGQ 226 SEQ ID NO: 202                SEQ ID NO: 203 Query 200 LAHLMRTGAVAG---QDVVDLLEELRGRALAAVAAPPGAPGLVPVVHLYTDDVLVRLLP 255      |  |||    |  |   +++ | |++++ | ||   |  +|  | |    ||+| Sbjct 227 DDLHGRPNAVAAYGVQGAVKRFDDILGGAISSIPACPGEAALAQMVRAYAD----RLVP 281                                                      SEQ ID NO: 204

The low similarity of moeN5 to other known prenyltransferases can be explained by the intrinsically low sequence homology among different prenyltransferases and the uniqueness of the reactions catalyzed by moeN5 (e.g. the linkage of geranyl and farnesyl pyrophosphates to give C₂₅ isoprene chain). No other genes were identified within the moe clusters which would govern the unusual rearrangement of the central part of moenocinol (Schuricht 2001, Neundorf 2003). Therefore, moeN5 is likely to control both prenyltransfer and C₂₅ chain rearrangement, or the latter step may be controlled by a gene outside the moe cluster. Additionally or alternatively, the rearrangement may occur spontaneously after the formation of the C₂₅ chain.

We disrupted moeN5 in a heterologous S. lividans TK24 host that was previously shown to produce moe A derivatives following integration of the appropriate genes in the chromosome. HPLC-MS analysis showed that the ΔmoeN5 strain accumulated two compounds, 22/23, having similar mass-spectral characteristics (Table 21). As is evident from LC-MS and MS² analyses of ΔmoeN5 extracts (SD), compounds 22/23 differ from compound 19, which is produced by the parental 38-1⁺ strain (FIG. 4), primarily in the structure of the polyprenol chain. Whereas moe A and all reported derivatives, including 19, contain an irregular C25 isoprenoid chain, compounds 22/23 have a C15 cis-farnesyl chain. 22 and 23 differ from one another in the structure of unit B (moe A numbering; see FIG. 1), which contains either a carboxyl group (22) or a carboxamide moeity (23). Heterogeneity at this position with regard to the unit B structure has already been reported for other moenomycins (Zehl 2006). The aforementioned results show that moeN5 encodes a prenyltransferase involved in the coupling of a C10 isoprene unit to either the C15 chain of 22/23 or its precursor(s). Based on results presented below, we suggest that the farnesylated trisaccharide precursors 8 and 9/10 are the first possible substrates for the MoeN5-catalyzed reaction (FIGS. 4A-4D). That is, based on the studies of the present invention, it is propose that the polypeptide encoded by the moeN5 gene is a prenyltransferase which participates in the conversion of Moe intermediate compound 8, 9, or 10 in the course of moenomycin biosynthesis to yield a Moe intermediate compound 11, 12 or 13, respectively, as shown in FIGS. 4A-4D. The farnesylated trisaccharides precursors described above antibiotic biological activity.

2. moeO5

The presence of a phosphoglycerate moiety in moe-like antibiotics is without precedent in secondary metabolism, and mechanisms of phosphorus incorporation into moe A have puzzled researchers for years. Gene moeO5, located upstream of the prenyltransferase gene moeN5, was identified as having a translation product with homology to geranylgeranylglyceryl diphosphate synthases (GGGPSs) from various Archaea (27% identity and 43% similarity to GGGPS from Thermoplasma acidophylum) (Nemoto 2003). The nucleotide and polypeptide sequences are shown in Tables 60 and 61, respectively. A sequence alignment between moeO5 and the closest homolog identified in the BLAST search is shown in Table 62.

TABLE 60 DNA Sequence of moeO5. GTGAACGCCTCACCGCAACTGGACCACCACACGGAACTCCACGCCGCACC ACCGCTCTGGCGGCCGGGACGCGTGCTCGCCCGGCTGCGCGAGCACCAAC CGGGCCCCGTCCACATCATCGACCCCTTCAAGGTCCCGGTGACGGAAGCG GTCGAGAAGGCGGCGGAGCTCACGCGGCTGGGCTTCGCCGCCGTCCTTCT GGCCAGCACCGACTACGAGTCGTTCGAGTCGCACATGGAGCCGTACGTGG CGGCGGTGAAGGCGGCCACCCCGTTACCGGTCGTCCTGCACTTCCCGCCC CGCCCGGGGGCCGGCTTCCCGGTGGTCCGCGGCGCGGACGCGCTCCTGCT GCCCGCGCTGCTGGGCTCGGGCGACGACTACTTCGTCTGGAAGAGCTTCC TCGAGACGCTGGCCGCCTTCCCCGGCCGAATACCCCGCGAGGAGTGGCCC GAGCTGCTCCTCACCGTCGCCCTCACCTTCGGCGAGGACCCCCGCACCGG GGACCTGCTCGGCACCGTGCCGGTGAGCACGGCCTCCACCGAGGAGATCG ACCGGTACCTCCACGTCGCCCGTGCCTTCGGTTTCCACATGGTGTACCTG TACTCGCGCAACGAGCACGTGCCGCCCGAGGTCGTACGCCACTTCCGCAA GGGGCTCGGCCCCGACCAGGTGCTCTTCGTGAGCGGCAACGTCCGCTCCG GGCGGCAGGTCACCGAGTACCTCGACAGCGGGGCGGACTACGTGGGGTTC GCCGGAGCCCTGGAACAGCCGGACTGGCGGTCCGCCCTCGCCGAGATCGC CGGGAGGCGGCCCGCCGCCCCGGCCCGTCCGGGGAGCGGGCGGTGA (SEQ ID NO: 21)

TABLE 61 Amino Acid Sequence of moeO5 VNASPQLDHHTELHAAPPLWRPGRVLARLREHQPGPVHIIDPFKVPVTEA VEKAAELTRLGFAAVLLASTDYESFESHMEPYVAAVKAATPLPVVLHFPP RPGAGFPVVRGADALLLPALLGSGDDYFVWKSFLETLAAFPGRIPREEWP ELLLTVALTFGEDPRTGDLLGTVPVSTASTEEIDRYLHVARAFGFHMVYL YSRNEHVPPEVVRHFRKGLGPDQVLFVSGNVRSGRQVTEYLDSGADYVGF AGALEQPDWRSALAEIAGRRPAAPARPGSGR (SEQ ID NO: 44)

TABLE 62 Sequence Homology of moeO5 gi|110553682|gb|EAT66825.1|geranylgeranylglyceryl phosphate synthase [Thermofilum pendens Hrk 5] Length = 255 Score = 50.8 bits (120), Expect = 6e−05, Method: Composition-based stats. Identities = 68/236 (28%), Positives = 110/236 (46%), Gaps = 14/236 (5%) SEQ ID NO: 205 Query 25 VLARLREHQPGPVHIIDPFKVPVTEAVEKAAELTRLGFAAVLLASTDYESFESHMEPYVA 84 +| ++|||    + +||| |     |   | |+   | +|+++  +   | |+  +  | Sbjct 9 ILEKIREHGAIHMTLIDPEKTTPEVAARIAREVAEAGTSAIMVGGSIGVS-EAMTDEVVL 67 SEQ ID NO: 206                                     SEQ ID NO:207 Query 85 AVKAATPLPVVLHFPPRPGAGFPVVRgadalllpallgsgddYFVWKSFLETLAAFPGRI 144 |+| +| +||+| ||  | |   + | |||+   ++| | + ||+  + ++        | Sbjct 68 AIKRSTEVPVIL-FPGSPTA---LSRHADAVWFLSVLNSQNPYFITGAQMQG-----API 118          SEQ ID NO:208 SEQ ID NO:209 Query 145 PREEWPELLLTVALTFGEDPRTGDLLGTVPVSTASTEEIDRYLHVARAFGFHMVYLY--S 202  +    |+|    +  ||      +  | |+  |  | +  |   |   ||  |||   | Sbjct 119 VKRYGLEVLPLGYIIVGEGGAVSIVSYTRPLPFAKPEVVAAYALAAEYMGFQFVYLEGGS 178 SEQ ID NO:210 SEQ ID NO:211 Query 203 RNEHVPPEWRHFRKGLGPDQVLFVSGNVRSGRQVTEYLDSGADYVGFAGALEQPD 258   | |||++|+   ||+     | | | +||     |   +||| +     +|+ + Sbjct 179 GGEPVPPKIVKMV-KGV-TTLPLIVGGGIRSPEVAKELAKAGADIIVTGTIVEESE 232                   SEQ ID NO:212

These enzymes couple either C₂₀ or Cu₂ isoprene chains to sn-glycerol-1-phosphate via an ether link, thus yielding the first intermediate to archaeal membrane lipids (Nemoto 2003, Soderberg 2001, Tachibana 2000). As well as GGGPSs, moeO5 also contains a so called phosphate-binding enzymes domain (COG1646) which is related to the PcrB-FMN domain. PcrB-like protein encoding genes are also present in bacterial genomes; however, their functions remain unknown. Sequence similarity between moeO5 and GGGPSs, and the structural resemblance between the product of GGGPS activity and moenocinol-phosphoglycerate suggests that phosphoglycerate is incorporated into moe A via a moeO5-assisted transfer of either moenocinol pyrophosphate or its precursor to phosphoglycerate. The possibility cannot be excluded, at this point, that phosphoglycerate (unit G) is attached to the sugar (unit F precursor) followed by a moeO5 transfer of the isoprene chain to the F-G intermediate. Additional biochemical characterizations of moeO5 are currently underway to determine substrate preferences. The sequence analysis of moeO5 (Ostash 2007), combined with the isolation of farnesylated monosaccharide intermediates (see below), suggests that the prenylsynthase MoeO5 couples phosphoglycerate to farnesyl pyrophosphate to yield the first dedicated moe A precursor 1P (FIGS. 4A-4D). This precursor is proposed to serve as the starting point for stepwise addition of the sugars.

E. Transport Genes

Four genes which meet functional criteria as components of ATP-binding cassette (ABC) transport systems have been located in moe cluster 1: moeP5, moeX5, moeD5 and moeJ5.

1. moeP5

Gene moeP5 encodes a putative ATP-binding protein (43% identity and 60% similarity) with no transmembrane domains. MoeP5 is related to DrrA-like family of ATP-ases (Kaur 1997) involved in drug resistance and lipid transport. The nucleotide and polypeptide sequences are shown in Tables 63 and 64 respectively. A sequence alignment between moeP5 and the closest homolog identified in the BLAST search is shown in Table 65.

TABLE 63 DNA Sequence of moeP5. ATGGGCCATTCCGTCGGTGCCCGAGAGGGGTACCACGGCATGTCCGAGCC CGCCGACCGCAAGATTCTCCTGCAGGCGCGCGGCGTCGTGAAACGCTACA AGCGCCGCCGCGTCCTGACCGGGGTCGATCTTGTCGTGCACGCGGGCGAG GTCGCCGCGATCGTCGGCAGCAACGGGACGGGCAAGTCCACCCTGCTCAA GATCTGCGCCGGTCTGCTCTCCCCCGACAAAGGACGGGTCACCGTCTCCG GCCACCTCGGCTACTGCCCGCAGAACGCGGGGGTCATGGGCTTCCTGACC CCCCGGGAGCACTTCACCCTCTTCGGCACCGGCCGGGGCCTGAGCCGCCG GGAGTCCGACCGCCGCGGCCGGAGACTCGCGGGAGAGCTCGACTGGGCCC CCGCGGAGGGCGTCCTTGCCAAGGACCTGTCGGGAGGAACCCGCCAGAAG CTGAACGTCGTCCTGTCGGCCCTGGGAGACCCGGACCTGCTGCTGCTCGA CGAGCCCTACCAGGGCTTCGACCACGGCTCCTACGTGGACTTCTGGCAGA GCGTCTGGGAGTGGCGCGAGGCGGGCAAGGCCGTCGTCGTGGTGACGCAC ATGCTCAACCAGCTCGACCGGGTGGACCAGGTGCTGGACCTCACCCCCGG CAAAGGAAGGGGCAACCGATGA (SEQ ID NO: 22)

TABLE 64 Amino Acid Sequence of moeP5 MGHSVGAREGYHGMSEPADRKILLQARGVVKRYKRRRVLTGVDLVVHAGE VAAIVGSNGTGKSTLLKICAGLLSPDKGRVTVSGHLGYCPQNAGVMGFLT PREHFTLFGTGRGLSRRESDRRGRRLAGELDWAPAEGVLAKDLSGGTRQK LNVVLSALGDPDLLLLDEPYQGFDHGSYVDFWQSVWEWREAGKAVVVVTH MLNQLDRVDQVLDLTPGKGRGNR (SEQ ID NO: 45)

TABLE 65 Sequence Homology of moeP5 gi|89319945|gb|EAS11435.1|ABC transporter related [Mycobacterium flavescens PYR-GCK] Length = 243 Score = 120 bits (301), Expect = 5e−26 Identities = 79/180 (43%), Positives = 105/180 (58%), Gaps = 0/180 (0%) Frame = +1 SEQ ID NO: 213 Query 115 LTGVDLVVHAGEVAAIVGSNGTGKSTLLKICAGLLSPDKGRVTVSGHLGYCPQNAGVMGF 294 | |||| +  |||  +|| ||+||||++||  | |+|| | |  || ||||||   | Sbjct 46 LRGVDLTLQPGEVVGLVGENGSGKSTIMKILVGELAPDAGTVVRSGVLGYCPQQPVVYER 105 SEQ ID NO: 214 Query 295 LTPREHFTLFGTgrglsrresdrrgrrlagELDWAPAEGVLAKDLSGGTRQKLNVVlsal 474 ||  ||  ||     ++     |  | |   | +    |  |  |||||  |||+ |+ | Sbjct 106 LTCDEHIELFARAYRMTHEGERRARRDLYEALGFERYAGTRADRLSGGTLAKLNLTLAML 165 Query 475 gdpdlllldEPYQGFDHGSYVDFWQSVWEWREAGKAVVVVTHMLNQLDRVDQVLDLTPGK 654  || +||||||| |||  +|+ ||  |   |+ |++|++++| +    | |+++ |  |+ Sbjct 166 ADPQVLLLDEPYAGFDWDTYLKFWDLVARRRDDGRSVLIISHFVADEHRFDRIVKLCDGR 225

Based on the studies of the present invention, it is propose that the polypeptide encoded by the moeP5 gene is a ABC transporter ATP-binding protein (Table 4).

2. moeX5

Downstream of the moeP5 gene is the moeX5 gene. The putative translation product shows homology to the N-terminal half of predicted bacterial membrane proteins (26% identity and 40% similarity). The nucleotide and polypeptide sequences are shown in Tables 66 and 67, respectively. A sequence alignment between moeX5 and the closest homolog identified in the BLAST search is shown in Table 68. Using topology prediction program TMHHM, 6 transmembrane helices were identified in the moeX5 protein. It is assumed that the moeP5 and moeX5 proteins are two elements of a transport system in which moeX5S is transporter and moeP5 energizes the transport via ATP hydrolysis.

TABLE 66 DNA Sequence of moeX5. ATGACGGCCACCCTGCGGATGGCGGAGATGACCTTCCGCGAACTGCTGCG CCGGCGGGGCGTGCTGGGCCTGCTGCTCCTGGTCCCGCTCGTCTTCTACC TCGGGCGTTACGACCAGACCGGCCAGGCGGTCCGGTTCGCCAGCCTCGGG GTGGGCTTCGCGGTCAGCGCCGCGGCCCTCTTCTCCGCGGTCGGCGGCCG GGAGATCGAACCGCTCCTGGCCCTCTCCGGGTTCCGCCCGCTCCAGCTCT TCCTGGGCCGCCTGCTGGCCCTCCTCACCGCCGGCATGGGCGTGTCCGCC CTCTACGCCGTGATCATCCTGGTCGGGCAGGACGTGGCGCACCCGCGGGC CGTCGCGGTGGAACTGGCGCTGAGCACACTGGTGGCGGTGCCGCTGGGAC TGCTGCTCGGGGCGGCCGTGCCACGGGACATGGAGGGCGCCCTGCTGCTG ATCTCCGTCATCGGCGCCCAGATGGTGATGGATCCGGCCAAGGATTCGGC CAAGGTGCTTCCCTTCTGGTCGACCCGGGAGATCATCACCTACGCGGTCG ACGGCGCGGACAGCGGGTCGTTCGACTCCGGGGTGGCCCACGCCGTCGGA GTGACGCTGCTGCTGGTCGCGGTGAGCGGTTGCGTGACGGCGGGCCGATT GCGCCGCCGGCGCCATCTGCAATTCGCGTGA (SEQ ID NO: 23)

TABLE 67 Amino Acid Sequence of moeX5 MTATLRMAEMTFRELLRRRGVLGLLLLVPLVFYLGRYDQTGQAVRFASLG VGFAVSAAALFSAVGGREIEPLLALSGFRPLQLFLGRLLALLTAGMGVSA LYAVIILVGODVAHPRAVAVELALTTLVAVPLGLLLGAAVPRDMEGALLL ISVIGAQMVMDPAKDSAKVLPFWSTREIITYAVDGADSGSFDSGVAHAVG VTLLLVAVSGCVTAGRLRRRRHLQFA (SEQ ID NO: 46)

TABLE 68 Sequence Homology of moeX5 gb|EAS99725.1|putative ABC transporter membrane protein [Mycobacterium, sp. KMS] Length = 500 Score = 55.5 bits (132), Expect = 2e−06 Identities = 63/237 (26%), Positives = 96/237 (40%), Gaps = 20/237 (8%) SEQ ID NO: 215                              SEQ ID NO: 216 Query 4 TLRMAEMTFRELLRRRGVLGLLLLVPLVFYLGR-----------YDQTGQAVRFASLG-- 50 || +      + +|    | +|+|||||| |               +|| | + |+ | Sbjct 12 TLLLTRSFITDYVRNPVNLIMLILVPLVFVLVAAGSIADAMELLQGRTGAATQTATSGWA 71 SEQ ID NO: 217                                        SEQ ID NO: 218 Query 51 VGFAVSAAALFSAVGGREIEPLLALSGFRPLQLFLGRL-LALLTAGMGVSALYAVIILVG 109  ||    |  |     |  +  | |+|    +|   |    || ||+ |||+    + Sbjct 72 AGFLSGLAMFQIRSARRADKRLQLAGLPAARLLAARAGTGLLMAGL-VSAVALAALAAR 130                                                SEQ ID NO: 219 Query 110 QDVAHPRAVAVELALTTLVAVPLGLLLGAAVPRDMEGALLLISVIGAQMVMDPAKDSAK- 168   + +|  | |   +  |+ + +| |+|| +   + ||++++ +    + + || Sbjct 131 TGIDNPARVIVGTLMFALIYLAIGALVGAVTADPVNGAVIILLIWMIDVFVGPAGSGGDY 190 SEQ ID NO: 220             SEQ ID NO: 221 Query 169 VLPFWSTREIITYAVDGADSGSF----DSGVAHAVGVTLLLVAVSGCVTAGRLRRRR 221 |   |     +|  + |  |       | |||    |  | || +      |  ||| Sbjct 191 VATRWFPTHFVTLWMVGTPSHHAGRLGDLGVASVWMVGALAVAGTVVSAGSRTGRRR 247

Based on the studies of the present invention, it is propose that the polypeptide encoded by the moeX5 gene is a ABC transporter membrane protein (Table 4).

3. moeD5 and moeJ5

Two other genes, moeD5 and moeJ5, encode proteins that both contain a nucleotide binding domain in the C-terminal region, and transmembrane segments (TMS) in the N-terminal half (5 TMS in moeJ5 and 6 TMS in moeD5 according to TMHHM program).

MoeD5 and moeJ5 are 40.9% similar to each other at the amino acid level, and display homology to putative and known ABC transporters/ABC transporter ATP binding proteins (41% identity and 55% similarity). The nucleotide and polypeptide sequences of moeD5 are shown in Tables 69 and 70, respectively. A sequence alignment between moeD5 and the closest homolog identified in the BLAST search is shown in Table 71. The nucleotide and polypeptide sequences of moeJ5 are shown in Tables 72 and 73, respectively. A sequence alignment between moeJ5 and the closest homolog identified in the BLAST search is shown in Table 74. The conserved domain search showed that moeD5 and moeJ5 are most similar to transporters from DPL and MRP families involved in drug, polypeptide, lipid and anionic substances transport (Chang 2003). In its domain architecture, the moeD5 translation product is similar to the well studied E. coli ABC transporter MsbA which also contains only one 6 TMS transmembrane domain and 1 nucleotide binding domain (McKeegan 2003). It is possible that a moeD5/moeJ5 heterodimer functions as an ATP-dependent pump. Additionally or alternatively, two ABC transporter systems may be involved in moe A efflux and/or its intramycelial transport. Multiple transport mechanisms are common for many organisms producing different antibiotics (Wilson 1999, Mendez 2001), in particular peptidoglycan biosynthesis inhibitors (Sosio 2003).

TABLE 69 DNA Sequence of moeD5.                                          GTGCTGCGC GGCTCGGCCCGCACCTACTGGACCCTCACCGGTCTGTGGGTCCTGCTGCG GGCGGGAACCCTGGTGGTGGGCCTGCTGTTCCAGCGGCTGTTCGACGCGC TGGGCGCGGGCGGGGGCGTGTGGCTGATCATCGCGTTGGTGGCCGCGATC GAGGCGGGACGGCTGTTCCTCCAGTTCGGCGTGATGATCAACAGGCTGGA GCCGCGGGTCCAGTACGGCACCACGGCGCGGCTGCGGCACGCCCTGCTGG GATCGGCCCTGCGCGGGTCGGAGGTGACGGCCCGCACCAGCCCCGGCGAG TCCCTGCGAACGGTGGGCGAGGACGTCGACGAGACGGGGTTCTTCGTCGC CTGGGCGCCGACGAACCTCGCCCACTGGCTGTTCGTCGCCGCGTCGGTCA CGGTGATGATGCGGATCGACGCCGTGGTCACCGGCGCCCTCCTCGCCCTC CTCGTCCTGCTGACGCTGGTCACCGCGCTGGCCCACAGCCGGTTCCTGCG GCACCGGCGGGCCACCCGGGCCGCGTCCGGGGAGGTGGCGGGAGCCCTGC GGGAGATGGTGGGCGCGGTGGGCGCGGTGCAGGCCGCCGCCGCCGAGCCG CAGGTCGCCGCGCACGTCGCCGGGCTGAACGGCGCCCGTGCCGAAGCCGC GGTGCGGGAGGAGCTGTACGCCGTCGTCCAGCGCACGGTGATCGGCAACC CGGCCCCGATCGGGGTCGGCGTGGTGCTGCTGCTGGTCGCGGGGCGGATG GACGAGGGGACCTTCAGCGTCGGCGATCTCGCCCTGTTCGCCTTCTACCT GCAGATCCTGACCGAGGCCCTGGGGTCGATCGGCATGCTGTCCGTGCGGT TGCAGCGGGTCTCGGTGGCGCTCGGCCGGATCACCAACAACCTCGGCTGC CGGCTGCGGCGGTCCCTGGAGCGGGCCAGTCCGCCGATCGCGTCCGACGC GCCGGGAGGGACCGGCGAGGGGGCCGCCGCCCCGGACGCCGGGCCGGAGC CCGCCCCGCCCCTGCGGGAACTGGCCGTGCGCGGGCTGACGGCCCGCCAC CCCGGGGCGGGGCACGGCATAGAGGACGTGGACCTGGTGGTGGAGCGGCA CACCGTCACCGTGGTCACCGGCCGGGTCGGTTCCGGCAAGAGCACCCTGG TCCGGGCCGTCCTCGGACTGCTCCCGCACGAGCGGGGCACCGTGCTGTGG AACGGCGAACCGATCGCCGACCCCGCGTCGTTCCTGGTGGCGCCGCGCTG CGGGTACACCCCGCAGGTCCCGTGTCTGTTCAGCGGGACGGTGCGGGAGA ACGTCCTGCTGGGCCGGACGGCGCGGCCTTCGACGAGGCCGTGCGCCTC GCCGTGGCGGAGCCCGACCTGGCGGCGATGCAGGACGGCCCGGACACCGT GGTGGGCCCGCGGGGCCTGCGCCTCTCGGGCGGGCAGATCCAGCGGGTCG CGATCGCCCGCATGCTGGTCGGCGACCCCGAACTCGTGGTGCTGGACGAC GTCTCCAGTGCCCTGGACCCGGAGACCGAGCACCTGCTGTGGGAGAGGCT GCTGGACGGGACGCGGACCGTGCTCGCGGTCTCCCACCGGCCCGCTCTGC TGCGCGCGGCCGACCGCGTGGTGGTGCTCGAGGGCGGGCGGGTGGAGGCC TCGGGCACCTTCGAGGAGGTCATGGCGGTCTCCGCCGAGATGGGCCGGAT CTGGACGGGTGCGGGTCCGGGGGGCGGGGACGCCGGGCCCGCTCCGCAGA GCCCTCCCGCGGGGTGA (SEQ ID NO: 24)

TABLE 70 Amino Acid Sequence of moeD5                                                VLR GSARTYWTLTGLWVLLRAGTLVVGLLFQRLFDALGAGGGVWLIIALVAAI EAGRLFLQFGVMINRLEPRVQYGTTARLRHALLGSALRGSEVTARTSPGE SLRTVGEDVDETGFFVAWAPTNLAHWLFVAASVTVMMRIDAVVTGALLAL LVLLTLVTALAHSRFLRHRRATRAASGEVAGALREMVGAVGAVQAAAAEP QVAAHVAGLNGARAEAAVREELYAVVQRTVIGNPAPIGVGVVLLLVAGRM DEGTFSVGDLALFAFYLQILTEALGSIGMLSVRLQRVSVALGRITNNLGC RLRRSLERASPPIASDAPGGTGEGAAAPDAGPEPAPPLRELAVRGLTARH PGAGHGIEDVDLVVERHTVTVVTGRVGSGKSTLVRAVLGLLPHERGTVLW NGEPIADPASFLVAPRCGYTPQVPCLFSGTVRENVLLGRDGAAFDEAVRL AVAEPDLAAMQDGPDTVVGPRGLRLSGGQIQRVAIARMLVGDPELVVLDD VSSALDPETEHLLWERLLDGTRTVLAVSHRPALLRAADRVVVLEGGRVEA SGTFEEVMAVSAEMGRIWTGAGPGGGDAGPAPQSPPAG (SEQ ID NO: 47)

TABLE 71 Sequence Homology of moeD5 ref|YP_075256.1|ABC transporter ATP-binding protein [Symbiobacterium thermophilum IAM 14863] Length = 590 Score = 372 bits (956), Expect = 2e−101 Identities = 233/559 (41%), Positives = 313/559 (55%), Gaps = 22/559 (3%) SEQ ID NO: 222          SEQ ID NO: 223 Query 24 LVVGLLFQRLFDALGAGGGV----WLIIALVAAIEAGRLFLQFGVMINRLEPRVQYGTTA 79 +| ||| |  || |     |    | ||||+ |    |+             | Sbjct 33 VVPGLLTQAFFDRLTGAAPVALDPWAIIALLMAAAVARVAALAAGFFASATGRESMANLL 92 SEQ ID NO: 251                                     SEQ ID NO: 224 Query 80 RLRHALLGSALRGSEVTARTSPGESLRTVGEDV---DETGFFVAWAPTNLAHWLFVAASV 136 | |+ |              ||||+|  + +||   +||  |+      +    | | ++ Sbjct 93 R-RNVLERILEMPGAAALPESPGEALNRLRDDVLHAEETADFMLDV---VGQTTFAAVAL 148   SEQ ID NO: 234                                 SEQ ID NO: 225 Query 137 TVMMRIDAVVTGALLALLVLLTLVTALAHSRFLRHRRATRAASGEVAGALREMVGAVGAV 196 ++++|||| +|  +   | |+ +||     | |++|| +| |+  |   + |+  +| || Sbjct 149 SMLLRIDARLTVLVFLPLALVLVVTRAVGRRILQNRRWSREATARVTALIAELFASVQAV 208 Query 197 QAAAAEPQVAAHVAGLNGARAEAAVREELYAVVQRTVIGNPAPIGVGVVLLLVAGRMDEG 256 | | || +| ||+  ||  |  | | + |   |  ++  | + +| |++||| |  |  | Sbjct 209 QVAGAESRVVAHLRRLNDERRRAMVADRLLTQVLESIALNASSVGTGLILLLGARTMATG 268                                               SEQ ID NO: 226 Query 257 TFSVGDLALFAFYLQILTEALGSIGMLSVRLQRVSVALGRITNNL-GCRLRRSLERASPP 315  |+||+ ||| +||  + +     |      ++  ||  |+   | |    | | |   | Sbjct 269 QFTVGEFALFVYYLGYVADFTHFAGRWLALYRQAGVAKDRLLALLQGAPPTRLLRRTEIP 328 Query 316 IASDAPGGTGEGAAAPDAGPEPAPPLRELAVRGLTARHPGAGHGIEDVDLVVERHTVTVV 375 +    |         |+  | || |||||   ||| |+| +| ||| | | + | + ||+ Sbjct 329 LRGPVP-------VPPEPPPPPAEPLRELRTEGLTYRYPDSGRGIEGVSLTIPRGSFTVI 381              SEQ ID NO:227 Query 376 TGRVGSGKSTLVRAVLGLLPHERGTVLWNGEPIADPASFLVAPRCGYTPQVPCLFSGTVR 435  |||||||+||+| ++|||| + ||| |||||+||| ||+| |||  ||||| |||||+ Sbjct 382 AGRVGSGKTTLLRVLMGLLPAQAGTVYWNGEPVADPGSFMVPPRCAATPQVPILFSGTLA 441            SEQ ID NO: 228 Query 436 ENVLLGRDG--AAFDEAVRLAVAEPDLAAMQDGPDTVVGPRGLRLSGGQIQRVAIARMLV 493 ||+ +| |   |    ||  || | ||| |+|| +| || ||+||||||+|| | ||| + Sbjct 442 ENIRMGLDATEAEVAAAVYDAVLERDLAGMEDGLETQVGARGVRLSGGQVQRTAAARMFL 501                              SEQ ID NO: 229 Query 494 GDPELVVLDDVSSALDPETEHLLWERLL-DGTRTVLAVSHRPALLRAADRVVVLEGGRVE 552   |||+++||+||||| ||| +|||||      | | |||| | || ||++++|+ ||| Sbjct 502 RRPELLIMDDLSSALDVETEQILWERLFRQRDVTCLVVSHREAALRRADQIILLKDGRVV 561 Query 553 ASGTFEEVMAVSAEMGRIW 571   || +|++| ||||  +| Sbjct 562 DRGTLDELLARSAEMRALW 580

TABLE 72 DNA Sequence of moeJ5. CTGCGCGGTGAACGGACCGCCGTGGCGCTGCTCGCCCTCCTGGTCCCCGC GGGGATGGGGCTCCAGCTGGTGGCGCCCTACCTGCTGCGCGGATTCATCG ACGGGGCGCTCTCCGGCGACTCCCGGAAGACGCTGCTGGACCTCGCCGCC TGGTCCCTGGCGGCCGCCGTCGGGACGCTCGTGGTCACCGCGGGCACCGA GGCGCTGTCCTCACGGGTCGCCTGGCGCAGCACCAACCGGTTGCGCGCGG ACCTGGTCGAGCACTGCCTGAGCCGGCCGCCGGGCTTCTACCGCAAGCAT CCGCCCGGCGAACTCGTCGAGCGGATGGACGGCGAGGTCACCCGGCTCGC CGCGGTGATGTCGACGCTGCTGCTGGAACTGCTGGCGCAGGCACTGCTGA TCGTCGGCATCCTCGTCGCCCTGTTCCGGCTGGAATGGCGGCTGGCCCTG GTGGTCGCCCCGTTCGCGGCAGGCACCCTCCTGCTGCTGCGGACCCTGGT GGGCCGCGCCATGCCCTTCGTCACCGCGCGGCAGCGGGTCGCGGCGGACC TGCAGGGCTTCCTCGAGGAGCGCCTCGCGGCGGCGGAGGACCTCCGCGTC AACGGGGCCTCGCGGTACACCCTGCGGGAACTCGGCGACCGGCAGGACGA CCTGTACCGGAAGGCCCGCGACGCGGCGCGGGCCTCGGTCCGCTGGCCCG CCACGGTGCAGGGCCTGTCCGCCGTCAGCGTCGTCCTGGCCCTGGCGGTC AGCGCCTGGCTGCACGCCCGCGGACAGCTCTCCACGGGGACGGCCTTCGC CTCCCTGTCCTACGCGATGCTGCTGCGCCGCCCCCTGCTCGCGGTCACCA CCCGCTTCCGCGAACTCGAGGACGCCGCCGCGAGCGCCCAGCGGCTGAGG GACCTGCTGGGCCACGGGACGGCGGCGCCCCGCACGGGACGCGGGACGCT GCCGGCCGGACTGCCCGGAGTCCGCTTCGACGGGGTCTCCTTCGGCTACG AGCCCGACGAGCCGGTGCTGCGGGACGTCTCCTTCACCCTGCGCCCCGGC GAACGCCTCGGCGTCGTGGGACGCACCGGCAGCGGCAAGTCCACCGTGGT CCGGCTGCTGTTCGGGCTCCACCACCCGGGGGCGGGCTCGGTGTCGGCAG GCGGCCTGGACCTGACGGAGATCGATCCCCGGGCGCTGCGCAGCCGGGTC GCGCTGGTCACCCAGGAGGTGCACGTCTTCCACGCCTCGCTGCGGGACAA CCTCACCTTCTTCGACCGCTCCGTCCCCGACGACCGGCTGCGCGCCGCTC TCGGCGAGGCCGGGCTCGGCCCCTGGCTGCGCACCCTGCCCGACGGTCTG GACACGCCGCTCGGCGCCGGGGCCCGCGGCATGTCCGCGGGCGAGGAGCA GCAGCTCGCGCTGGCCAGGGTGTTCCTGCGCGATCCGGGGCTGGTCCTGA TGGACGAGCCGACGGCCCGGCTGGATCCGTACAGCGAGCGGCTCCTGATG CCCGCGCTGGAGCGGCTGCTCGAGGGCCGCACCGCCGTCGTGGTGGAGCA CCGCCCGCACCTGCTCCGGAACGTCGACCGGATCCTGGTGCTGGAGGAGG GGAAGGTCGCCGAGGAGGGGGAGCGGAGGGTCCTCGCCGCCGATCCCGGG TCGCGCTTCCACGCACTCCTCCGCACGGCCGGAGCCACCCGGTGA (SEQ ID NO: 25)

TABLE 73 Amino Acid Sequence of moeJ5 LRGERTAVALLALLVPAGMGLQLVAPYLLRGFIDGALSGDSRKTLLDLAA WSLAAAVGTLVVTAGTEALSSRVAWRSTNRLRADLVEHCLSRPPGFYRKH PPGELVERMDGDVTRLAAVMSTLLLELLAQALLIVGTLVALFRLEWRLAL VVAPFAAGTLLLLRTLVGRAMPFVTARQRVAADLQGFLEERLAAAEDLRV NGASRYTLRELGDRQDDLYRKARDAARASVRWPATVQGLSAVSVVLALAV SAWLHARGQLSTGTAFASLSYAMLLRRPLLAVTTRFRELEDAAASAQRLR DLLGHGTAAPRTGRGTLPAGLPGVRFDGVSFGYEPDEPVLRDVSFTLRPG ERLGVVGRTGSGKSTVVRLLFGLHHPGAGSVSAGGLDLTETDPRALRSRV ALVTQEVHVFHASLRDNLTFFDRSVPDDRLRAALGEAGLGPWLRTLPDGL DTPLGAGARGMSAGEEQQLALARVFLRDPGLVLMDEPTARLDPYSERLLM PALERLLEGRTAVVVEHRPHLLRNVDRILVLEEGKVAEEGERRVLAADPG SRFHALLRTAGATR (SEQ ID NO: 48)

TABLE 74 Sequence Homology of moeJ5 gi|51892564|ref|YP_075255.1|ABC transporter ATP-binding protein [Symbiobacterium thermophilum IAM 14863] Length = 582 Score = 375 bits (963), Expect = 3e−102, Method: Composition-based stats. Identities = 249/545 (45%), Positives = 335/545 (61%), Gaps = 5/545 (0%) SEQ ID NO: 230     SEQ ID NO: 231 Query 19 MGLQLVAPYLLRGFID---GALSGDSRKTlldlaawslaaavgtlvvtagtEALSSRVAW 75 +||||+ | +|| |+|   | +||    +|+ ||   +  |     ||     ||  |+| Sbjct 34 IGLQLINPQILRRFLDTAAGEVSGGP--SLVTLALAFIGVAFAVQAVTVLARYLSESVSW 91 SEQ ID NO: 232              SEQ ID NO: 233 Query 76 RSTNRLRADLVEHCLSRPPGFYRKHPPGELVERMDGDVTRLAAVMSTlllellaqallIV 135 |+|| ||||| ||||    ||+++  |||+|||+||||| |+   | | + +||  +|++ Sbjct 92 RATNELRADLAEHCLRLDLGFHKRRTPGEMVERIDGDVTALSQFFSQLFIGVLANLVLML 151 Query 136 GILVALFRLEWRLALVVAPFaagtllllrtlvgrAMPFVTARQRVAADLQGFLEERLAAA 195 |||| ||| +||  + +  ||| || +|  +   ++|  | +++ +|+  |+| | |+ Sbjct 152 GILVLLFREDWRAGVAMTLFAAFTLWVLGRIHELSVPVWTRQRQASAEFYGYLGEVLSGT 211 Query 196 EDLRVNGASRYTLRELGDRQDDLYRKARDAARASVRWPATVQGlsavsvvlalavsaWLH 255 | +|  ||  + |        | ||+   |+       +|     ||   |+| | |||+ Sbjct 212 EAIRAGGARGWALHRFLRNVQDFYRQNLAASMMFWLTWSTSIVTFAVGAALSLGVGAWLY 271 Query 256 ARGQLSTGTAFASLSYAMLLRRPLLAVTTRFRELEDAAASAQRLRDLLGHGTAAPRTGRG 315 ||| ++ || +    |  |||||+  + |  +||+ | |+ +|+ +|    |  | Sbjct 272 ARGGVTVGTVYLLFHYTELLRRPIEQIRTHLQELQRAGAAVERVEELFAQRTRVPDGPGR 331 Query 316 TLPAGLPGVRFDGVSFGYEPDEPVLRDVSFTLRPGERLGVVGRTGSGKSTVVRLLFGLHH 375  || |   |   |||| |||  ||||||   + ||| +|++|||||||||+ |||   + Sbjct 332 ALPPGPLSVELVGVSFAYEPGAPVLRDVDVRIEPGEVVGLLGRTGSGKSTLARLLLRFYD 391 Query 376 PGAGSVSAGGLDLTEIDPRALRSRVALVTQEVHVFHASLRDNLTFFDRSVPDDRLRAALG 435 | || |  ||+|| |     +|+||  |||+| +|  ++|||||||   | | || | | Sbjct 392 PDAGIVRLGGVDLREATVAGVRARVGFVTQDVQLFAGTVRDNLTFFSPEVSDGRLLAVLE 451 Query 436 EAGLGPWLRTLPDGLDTPLGAGARGMSAGEEQQLALARVFLRDPGLVLMDEPTARLDPYS 495 | ||||||++|| |||||| +|  |+|||| | |||||||| |||||++|| ++|||| + Sbjct 452 ELGLGPWLQSLPQGLDTPLESGGGGLSAGEAQLLALARVFLADPGLVILDEASSRLDPAT 511 Query 496 ERLLMPALERLLEGRTAVVVEHRPHLLRNVDRILVLEEGKVAEEGERRVLAADPGSRFHA 555 | |+  |++||||||| +++ ||   +   | ||+||+|+| | | |  ||||| ||| Sbjct 512 ESLVERAVDRLLEGRTGIIIAHRLATVERADTILILEDGRVVEYGPRAELAADPASRFFR 571 Query 556 LLRTA 560 +|| Sbjct 572 MLRAG 576

Based on the studies of the present invention, it is propose that the polypeptide encoded by the moeD5 and moeJ5 genes are ABC transporters (Table 4).

G. Regulatory Genes

Genes governing the production of early building blocks of antibiotics are usually part of gene cluster for a given secondary metabolite. For example, NDP-hexoses, the first intermediates to many glycosylated antibiotics are produced from hexose-1-phosphate and NTP by dedicated nucleotidyltransferases encoded within gene clusters for biosynthesis of secondary metabolites (Kudo 2005, Luzhetskyy 2005, Murrell 2004). The association of genes for biosynthesis of isopentenyl pyrophosphate (IPP) and those for certain isoprene-derived antibiotics has also been recently demonstrated (Durr 2006, Kawasaki 2006, Dairi 2005). Genes for activated sugar and IPP biosynthesis were not identified within the moe clusters. This, along with the absence of dedicated regulatory moe genes (as yet, no traditional regulatory sequences were identified) poses intriguing questions about temporal control of enzyme “building blocks” apparently required for Moe A biosynthesis in S. ghanaensis.

It was found that the moeA5, moeO5, moeR5 and moeE5 genes contain codon “TTA” (2 were identified in moeE5). In the S. coelicolor A3(2) genome, only 145 genes out of 7825 possess TTA codons (Chater 2006).

The regulatory role of the TTA codons in genes for transcriptional activators of antibiotic production has been established (Rebets, 2006, Bibb 2005). For example, the gene for leucil tRNAUUA is expressed efficiently in late stationary phase of growth in S. coelicolor, thus exerting temporal control over the expression of antibiotic biosynthesis genes (Leskiw 1993). The moe clusters appears to lack a regulatory genes containing TTA codon(s), while the TAA condon is present in structural moe genes. It is likely that the different efficiency of codon TTA translation in different growth phases is an important mechanism of the temporal regulation of moe A production. However, mistranslation of TTA codons (Trepanier 2002) can negate the importance of such a suggested regulatory mechanism.

One of the unusual features of the moe clusters is the absence of pathway-specific regulatory genes. Rare TTA codons present in several moe genes were speculated to control the onset of moe A production (Ostash 2007). Mutations in the gene bidA, which encodes LeutRNAUUA, are known to affect many processes in streptomycetes, and antibiotic production is one of them (Chater 2006). It is logical to suppose that the regulatory effect of LeutRNAUUA is exerted at the translation level—i.e., that a scarcity of this tRNA until late in the cell cycle might limit the expression of TTA-containing genes. However, the pleiotropic nature of bidA mutants (Hodgson 2000), the presence of TTA codons mainly in regulatory genes and, finally, the mistranslation of TTA codons (Trepanier 2002) may lead to overinterpretation of the role of bldA gene. In this respect, moe cluster 1 is a rare case in which several TTA codons are located within structural genes, making it easier to link the bldA gene to antibiotic production.

To examine whether bidA is important for moe A biosynthesis, S. lividans J1725 carrying a mutated bldA gene (Leskiw 1991) was used. A bioassay of methanol extracts from the strains shows that J1725 coexpressing moeno38-1 and plasmid pUJ584, which carries the intact bidA gene, produced compound 19 (FIG. 14, spot 3) whereas a moeno38-1+ control strain carrying the empty vector pIJ303 did not (FIG. 14, spot 2). Evidently, the I gene regulates moe A production. Expression of pJ584 in the βmoeN5 strain did not, however, enhance the production of moe A intermediate 23, as judged by LC-MS and bioassays. This experiment suggests that increasing bidA expression above wild type levels does not produce a concomitant increase in antibiotic expression, implying that other factors limit the production of moenomycins. The identity of 19 produced in 38-b 1 ⁺pIJ584⁺J1725 strain was also confirmed by MS analysis (data not shown).

The wild-type S. ghanaensis strain produces moe A at a very low level, which complicates the analysis of metabolites produced by wild-type and recombinant S. ghanaensis strains. Thus, a means to increase moe expression is useful. The moe biosythesis-related genes of the invention are useful for the enrichment (i.e., overpression of moe and moe precursor molecules. In one embodiment, the invention provides a cell (prokaryotic or eukaryotic) which is genetically engineered to express one or more moe biosynthesis-related polypeptides encoded by the moe biosynthesis related genes of the invention as a means of expressing moe, a moe precursor molecule or chemical derivative thereof. In one embodiment the cell comprises a vector encoding one or more one or more moe biosynthesis-related genes of the invention. Methods useful for cloning and expression of bacterial genes in both prokaryotes and eukaryotes are well recognized in the art.

Regulatory genes (e.g., repressors, activators) may be identified by methods described above and by methods known in the art. As many bacterial genes involved in antibiotic production are expressed as operons (a cluster of structural genes with shared promoter or operator and terminator sequences), it is possible that a single protein or set of proteins may be responsible for the coordinated expression or repression of the entire moe cluster 1 and/or moe cluster 2 genes. Numerous such regulatory genes have already been identified in various antibiotic producing actinomycetes (e.g., redD involved in undecylprodigiosin biosynthesis regulation; actII-orf4 involved in actinorhodin biosynthesis regulation; dnrI involved in daunorubicin biosynthesis regulation; srmR involved in spiramycin biosynthesis regulation; strR involved in streptomycin biosynthesis regulation; ccaR involved in cepamycin and clavulanic acid biosynthesis regulation; mtmR, involved in mithramycin biosynthesis regulation). See Lombo, et al., 1999.

For example, an in silico search for homologues of such known bacterial repressors and activators, or functional regions of such known repressors and activators (e.g., DNA binding motiffs) may be performed. The putative regulators may then be disrupted and moe A production (e.g., the amount of moe A) may be evaluated in the mutant. An increase in moe A production in the mutant would indicate that the putative regulator was capable of acting as a repressor of moe A production, while a decrease in moe A production would indicate that the putative regulator was capable of acting as an activator of moe A production. Additionally or alternatively, the amount (e.g., the level of expression) of individual moe gene products (either RNA or protein) may be evaluated in the regulatory mutant.

To further test the function of a putative regulatory gene, the gene could be cloned by methods described above (e.g., PCR primers could be constructed which flank the gene sequence of interest; the sequence could then be amplified, isolated, and cloned into an appropriate expression or integration vector). The gene could then be introduced into a moe-producing strain and, if cloned in a high-copy number vector or cloned in front of a strong promoter (e.g., the ermE promoter), the effects of overexpression of the putative regulatory gene could be evaluated. For example, if the putative regulator was a repressor of moe A production, overexpression would lead to a decrease, or completely abolish, moe A production. If the putative regulator was an activator, then overexpression would likely lead to an increase in moe A production, and the subsequent development of moe A overexpressing strains. (See also section V.B, below). The cloned functional copy of the gene could also be introduced into the knockout mutant to verify gene function.

V. Characterization of Moe A Intermediates

A. moeGT3 Disruption

The plasmid for moeGT3 disruption pMO12 was transferred into S. ghanaensis ATCC14672 via conjugation and its homologous integration into the genome was promoted according to the described procedure (Ostash 2007). The site-specific integration of the pMO12 in the S. ghanaensis MO12 strain was confirmed via Southern analysis using DIG-labeled moeGT3 internal fragment as a probe (FIG. 15A). In the wild type strain, the moeGT3 gene resides in a 4.3-kb BamHI fragment whereas in the MO12 strain the corresponding hybridizing band is absent and a new 11-kb band is present. The latter corresponds to integration of a 7-kb pMO12 plasmid into the 4.5-kb BamHI moeGT3-containing fragment of the S. ghanaensis chromosome. Similarly, hybridization pattern of XhoI-digests of wild type and the mutant strain confirms the insertional inactivation of moeGT3 (FIGS. 15A, 15C). Introduction of plasmid pMO14 (P_(ermE)-moeGT3) into MO12 strain restored moe A production.

The purified cell extracts of MO12 showed strong antibacterial activity, implying that none of the steps essential for moe A pharmacophore formation is affected in the mutant (FIG. 15B). LC-MS analysis revealed that MO12 accumulated known moenomycin C₄. (MmC4) as a final product (FIG. 16). This conclusion was confirmed by high-resolution mass-spectral analysis (calculated mass of negative ion of MmC4: 1418.6012 Da, observed: 1418.6016 Da). Furthermore, the strain accumulated MmC4 precursor lacking chromophore unit (calculated mass for (M-H): 1322.5801 Da; observed: 1322.5796 Da). We also observed the di- and trisaccharide fragments of MmC4 (FIGS. 16A-16B), which could represent its intermediates or result from MmC4 fragmentation already in MS¹ experiments (such a phenomenon has been reported for moenomycins (Zehl 2006).

B. Analysis of S. lividans TK24 Strains Various Subsets of Moe Genes

We performed detailed LC-MS analysis of the mixtures of moenomycins produced by the recombinant S. lividans strains. This analysis allowed us to detect and isolate the final moe A-related metabolite produced by each strain for high-resolution MS analysis (see Table 21). Certain pure compounds were also studied by MS/MS and NMR. Results of these experiments as well as the pattern of intermediates/degradation products found in the extract of recombinant strains guided our predictions of the structures of novel moenomycins as it is described below. In all cases the observed masses coincide with calculated ones for compounds shown on FIGS. 4A-4D.

1. S. lividans ΔmoeN5 (Prenyltransferase Gene moeN5 Deletion in moeno38-1)

This strain accumulates two new closely related compounds 22/23 (FIGS. 4A-4D) not detected in the extracts of either empty heterologous host (TK24) or other recombinant S. lividans strains. The dramatically shifted Rt of 22/23 when comparing to moe A or compound 19 is an indication of shortened lipid chain. The analysis of ΔmoeN5 extract revealed the presence of intermediates to 22/23, namely the compounds 2, 3, 5, 8, 25 (FIGS. 17A-17B). The exact masses of negative ions of all aforementioned compounds coincide with calculated ones (compound 5—calculated: 943.3694 Da, observed: 943.3688 Da; 8—calculated: 986.3752 Da, observed: 943.3750 Da; 25—calculated: 1189.4546 Da, observed: 1189.4512 Da) and point to the presence of common polyprenyl chain of 15 carbons. Presence of 1072/1073 Da peak in MS² spectra of 22/23 (FIGS. 18A-18B) also witness that these compounds possess pentasaccharide-phosphoglyceric acid moiety found in other moenomycins (Ostash 2007, Zehl 2006). Compounds 22/23 show the biological activity (see next section). Taking these data together, we proposed the structures of 22/23 as shown below in Formula II:

R₂ may be either —OH (Compound 22) or —NH₂ (Compound 23).

We failed to restore the production of moenomycins having C25 isoprenoid chain (e.g. 19) in ΔmoeN5 strain using several constructs where moeN5 expression was driven from constitutive P_(ermE)* promoter. Moreover, we revealed that moeN5 overexpression even led to decrease in 22/23 production (data not shown), pointing to the existence of as-yet-unknown regulatory mechanism governing moeN5 expression under natural conditions. Therefore we constructed plasmid pOOB63a which contains both moeO5 and moeN5 genes along with 0.6 kb moeX5-moeO5 intergenic region. We assumed that the intergenic region contains promoter responsible for expression of moeO5moeN5 operon. Indeed, introduction of pOOB63a into ΔmoeN5 strain restored the production of 19. Plasmid pMoeO5extra (contains only moeO5 under P_(ermE*)) did not restore the production of 19 in ΔmoeN5 strain, meaning that only gene moeN5 is responsible for the restoration of 19 production when using plasmid pOOB63a.

B. Qualitative Analysis of Antibacterial Activity of Novel Moes

We examined the bioactivity of several purified moe A intermediates described above on a B. cereus reporter strain using a disk diffusion assay. The monosaccharide intermediates had no activity, while the moenocinol-linked penta- and tetrasaccharide compounds were roughly as active as moe A itself (FIG. 19). Disaccharide 4 could not be tested due to an extremely low production level and decomposition to 2/3. Compound 22/23, which features a C15 isoprenoid chain, showed antibacterial activity at submicromolar concentrations. Neryl-moenomycin was recently shown to be biologically inactive. Other moe derivatives having a farnesyl chain may show similar activity to compound 22/23. Therefore, in one embodiment, the present invention provides moe derivatives having the following general structure.

In one embodiment, the present invention provides a moenomycin derivative having the structure:

wherein

R and R¹ independently are selected from the group consisting of hydroxyl, and —NHR² where R² is hydrogen, alkyl, cycloalkyl, or substituted cycloalkyl;

X is hydrogen, or

where R³ is selected from the group consisting of hydrogen and hydroxyl; and

X¹ is hydrogen,

where R⁴ is selected from the group consisting of hydrogen and hydroxyl;

R⁵ is selected from the group consisting of hydroxyl, and —NHR⁶ where R⁶ is hydrogen, alkyl, cycloalkyl, or substituted cycloalkyl, and

R⁷ is hydrogen or methyl,

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments, R and R¹ independently are —NH₂. In some embodiments, X is hydrogen. In other embodiments, X is

where R³ is selected from the group consisting of hydrogen and hydroxyl.

In some embodiments, X¹ is hydrogen. In other embodiments, X¹ is

In still other embodiments, X¹ is

where R⁴ is selected from the group consisting of hydrogen and hydroxyl and R⁵ is selected from the group consisting of hydroxyl, and —NH₂.

In some embodiments, the structure of the moenomycin derivative is:

where R³ is selected from the group consisting of hydrogen and hydroxyl,

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments, R³ is hydrogen. In other embodiments, R³ is hydroxyl.

In some embodiments, the structure of the moenomycin derivative is:

where R⁴ is selected from the group consisting of hydrogen and hydroxyl,

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments, R⁴ is hydrogen. In other embodiments, R⁴ is hydroxyl.

In some embodiments, the structure of the moenomycin derivative is:

where R⁴ is selected from the group consisting of hydrogen and hydroxyl,

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments, R⁴ is hydrogen. In other embodiments, R⁴ is hydroxyl.

In some embodiments, the structure of the moenomycin derivative is:

where R⁴ is selected from the group consisting of hydrogen and hydroxyl,

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments, R⁴ is hydrogen. In other embodiments, R⁴ is hydroxyl.

In some embodiments, the structure of the moenomycin derivative is:

where R⁴ is selected from the group consisting of hydrogen and hydroxyl,

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments R⁴ is hydrogen. In other embodiments, R⁴ is hydroxyl.

In some embodiments, the structure of the moenomycin derivative is:

wherein R⁴ is hydrogen or hydroxyl and R⁶ is hydrogen, alkyl, cycloalkyl, or substituted cycloalkyl,

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments, R⁴ is hydrogen. In other embodiments, R⁴ is hydroxyl.

In some embodiments, R⁶ is hydrogen or substituted cycloalkyl. In some preferred embodiments, substituted cycloalkyl is

In some embodiments, R⁴ is hydroxyl and R⁶ is

In some embodiments, pharmaceutical composition comprising the moenomycin derivative as defined above and a pharmaceutically acceptable carrier.

In another embodiment, the present invention provides a moenomycin derivative having the structure:

wherein

R⁷ and R⁸ independently are selected from the group consisting of hydroxyl, and —NHR⁹ where R⁹ is hydrogen, alkyl, cycloalkyl, or substituted cycloalkyl; and

R¹⁰ is hydrogen or hydroxyl;

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments, R⁷ and R⁸ independently are —NH₂. In some embodiments, R¹⁰ is hydroxyl.

In some embodiments, the structure of the moenomycin derivative is:

or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.

In some embodiments, pharmaceutical composition comprising the moenomycin derivative as defined above and a pharmaceutically acceptable carrier.

The moenomycin derivatives disclosed herein can be synthesized by genetic synthesis or by a conventional synthetic chemical synthesis by a person skilled in the art. The synthetic chemical synthesis can use known starting materials such as phosphoglycerate-farnesyl.

Perhaps the most surprising outcome of these studies, however, is that product 11 has biological activity (Welzel 2005) (FIG. 19). Compound 11 is shown below as Formula III:

Compound 11 does not contain the C ring, proposed to be part of the moe A pharmacophore, but instead contains a branching glucose unit. Evidently, either the C or the D ring can confer biological activity on the EF disaccharide, although the molecular basis for their effects is unclear. A crystal structure of moe A bound to a peptidoglycan glycosyltransferase domain has been reported and shows that the C ring binds in the active site cleft while the D ring protrudes from the cleft (Lowering 2007). The resolution of the complex is not sufficient to determine if there are specific contacts from conserved amino acids to the C ring, but it is almost certain that there are none to the D ring.

IV. Moe Assembly Scheme

The bioinformatic and genetic analysis of the identified moe genes suggests a scheme of moe A biosynthesis, depicted in FIGS. 4A-4D. Not wishing to be bound or limited by any theory, structures 23 and 3 witness that moe A biosynthesis starts with a unique reaction of farnesylation of phosphoglyceric acid. Sugars are transferred to the putative molecule 1P one by one. The carboxamide on unit F is required to proceed through the pathway; methylation, in contrast, can occur prior to the second glycosylation, but is not required to make the moe A pentasaccharide. Carbamoylation of the first sugar (unit F, FIG. 1) appears to happen only after attachment of three sugars since we could not detect any mass-peaks corresponding to carbamoylated disaccharide precursors (FIGS. 4A-4D). Prenylation to form the “mature” C25 isoprenoid chain also appears to occur only after the attachment of three sugars. The sequence of carbamoyltransferase and prenyltransferase reactions shown in FIGS. 4A-4D is proposed, however, the reverse order cannot be excluded. However, it is evident from the production of des-carbamoylated MmA pentasaccharides in a ΔmoeM5 strain and C15 MmA pentasaccharides in a ΔmoeN5 strain that carbamoylation is not required prenylation and prenylation is not required for carbamoylation. The attachment of the two remaining sugars and the chromophore completes moe A biosynthesis. Thus, the overall pathway has been delineated, although there remain questions about particular transformations, including the unusual MoeN5-catalyzed prenyl transfer reaction to generate the irregular C25 isoprenoid chain of MmA and the biochemistry of A ring biogenesis.

In one aspect of the present invention, the biosynthetic pathway for moe A may be altered for the production of phosphoglycolipid analogs. The biosynthetic pathway involves approximately ten essential (for biological activity) structural genes. The present inventors have discovered that a complex mixture of related compounds arises because the moenomycin biosynthetic machinery is flexible. Except for unit F carboxyamidation, all examined sugar tailoring reactions can be “switched off” without adverse effects on the assembly of the prenyl-phosphoglycerate-glycoside scaffold. Prenyl transfer to form the C25 lipid from the C15 precursor can also be switched off. Moreover, both MoeGT4 and MoeGT5 can accept either UDP-GlcNAc or UDP-chinovosamine as donor substrates, as evident from production of pholipomycin and moenomycin C₃. This, in combination with unbalanced expression of certain moe genes, leads to an interesting phenomenon where relatively simple pathway yields not one compound but a mixture of related ones. Thus, the spectrum of moenomycin metabolites and deriviates may be altered by selective deletion/overexpression of certain genes.

In some embodiments, the present invention provides for genetic manipulations of the moe A pathway for the discovery and production of clinically valuable molecules. Consistent with this expectation, we have already yielded several unexpected bioactive compounds. For example, we have found that the farnesylated moe A analog 23 has biological activity at submicromolar concentrations in a disk diffusion assay. This or other C15 derivatives may have better pharmacokinetic properties than the parent compounds, which would compensate for the evident decrease in potency. We have also shown that trisaccharide 11 is biologically active, providing an alternative scaffold for combinatorial or chemoenzymatic explorations to generate analogs.

VI. Further Examples of the Present Invention

A. Inactivation of Either Individual Moe Genes or Sets of Moe Genes to Generate Moe Derivatives, Analogs, Fragments and Novel Compounds

The above description of the insertional inactivation is one possible model for how disruptions of separate moe genes leads to the generation of bacterial strain with altered profile of moes production. The generated recombinant strains may be the source of novel moes which possess better antibacterial and pharmacological properties. These novel moes may further be chemically or chemoenzymaticaly modified to produce novel compounds. For example, the 406 bp internal fragment of gene moeC4 (corresponding to amino acids 146-281 of moeC4) for 5-aminolevulinate synthase may be amplified with primers alsupHindIII and alsrp1EcoRI and cloned into pKC1139. Following the procedure utilized for moeM5 gene disruption, a S. ghanaensis strain with a deficient aminolevulinate synthase gene may be generated. The strain would then accumulate a late moe intermediate lacking the chromophore unit. This simplified moe derivative would likely have slightly reduced antibacterial activity. Additionally, its acid or amide functionality at unit B could be further modified using standard chemical techniques or chemoenzymatic approaches (which will be described in following examples) to give a compound with improved properties.

For example, a more chemically reactive unit can be attached to the aforementioned simplified moe instead of the natural C5N chromophore. By strengthening the interactions between the chromophore-saccharide portion of moe and its target (transglycosylase involved in bacterial peptidoglycan biosynthesis) it might be possible to then remove/shorten the lipid chain from moe without the loss of its biological activity.

A combination of certain mutations in individual moe genes also presents options for the production of moe analogs. For example, combining the mutations in the methyltransferase gene moeK5, the sugar tailoring gene moeR5 and the moeC4 gene, a novel moe could be produced lacking unitA (chromophore), methyl group in ring F, and bearing a hydroxyl in the C6 position of unit C. S. ghanaensis produces mixture of moes, and some of them are products of branches of the main moe pathway. Thus, the disruptions of certain moe genes can be used to block the shunt moe pathways.

B. Overexpression of Moe Genes in Streptomyces ghanaensis to Generate Moe Overproducing Strains

The amplification of gene clusters for antibiotic biosynthesis in genomes of producing strains is known to lead to overproduction of the antibiotics. The availability of moe genes paves a similar path for the rational generation of moe overproducing strains. For example, protoplasts of wild-type S. ghanaensis strain (obtained according to the described procedure) may be transformed with alkali denaturated cosmid moeno38 (as described in Oh 1997) carrying most of the moe genes. The protoplasts may be regenerated on R2YE medium and selected for kanamycin resistance in order to isolate S. ghanaensis clones in which cosmid moeno38 has integrated into the host chromosome by homologous recombination. The production of moes may then be studied as described above to verify the overproducing phenotype. In this way the duplication of moe genes can be achieved.

In order to obtain S. ghanaensis strains with multiple copies of moe gene cluster per genome, cosmid moeno38-1 from cosmid moeno38, in which the neo gene marker has been replaced with 5.3 kb hyg-oriT^(RK2)-int^(φC31) PCR fragment of vector POOB40 by suing the λ-Red recombination system (Gust 2003). The construct could be transferred into S. ghanaensis and recombinant colonies could be selected for hygromycin resistance. The production of moes could then be studied as described above. Sometimes the expression of only certain genes may be a limiting factor in biosynthesis of a given secondary metabolite. Therefore, the expression of individual moe genes could be useful to increase the production of moes or moe-related compound.

C. Heterologous Expression of Moe Genes

The heterologous expression of moe genes may be used to for a number of reasons such as 1) to generate recombinant strains which produce more moes than, for example, S. ghanaensis, 2) to produce moe derivatives, 3) to generate novel compounds resulting from the modification of metabolites other than moes with Moe proteins. The moe genes of the present invention may be expressed in a variety of cell expression systems known in the art. By way of example, but not by way of limitation, bacterial (e.g., Streptomyces sp., E. coli), mammalian (e.g., mouse, human, rat, hamster, etc., such as NIH-3T3, HeLa, HEK 293, etc.), yeast (e.g., Saccharomyces cerevisiae, Pichia pastoris) and insect cells (e.g., Drosophila melanogaster Schneider cells) may be used. Numerous expression systems, including appropriate expression vectors, cell growth media and conditions, and cell lines, are known in the art and many are commercially available.

As one example, the moeno38-1 construction (which was transferred to S. lividans TK24, described above) may be transferred to other organisms, e.g. S. coelicolor M145, by applying the conjugation protocol, also described above. The transconjugants may be selected for hygromycin. The production of moes may be checked by biochromatography and LC-MS analysis. The recombinant strains may be better producers of moes than the wild type S. ghanaensis strain. Further, by using qp-Red recombination system, for example, modified moeno38 cosmids with deletions of one or more moe genes may be obtained. Their expression in a heterologous host (as described above) would lead to production of moes analogs.

The expression of sugar tailoring moe genes in the producers of other glycoside antibiotics may lead to production of novel hybrid compounds. For example, a given sugar tailoring gene may be cloned into a Streptomyces expression vector (such as pMKI9 or analogs). The resulting construct may then be transferred into a strain of interest via intergeneric conjugation, protoplast transformation or electroporation (Kieser 2000). The recombinant strain could then be subject to detailed analysis in order to detect the changes in antibiotic production.

D. Overexpression of Moe Proteins in E. coli or Streptomyces and their Use as Catalysts for Generation of Novel Molecules

Purified enzymes encoded by moe genes can be used to produce novel molecules. This approach is based on certain degree of substrate promiscuity of the enzymes involved in antibiotic production. For example, the gene for moenocinol 3-phosphoglycerate synthase may be amplified from cosmid moeno38 with primers moeO5HindIIIrp (AAAAAGCTTCCGCCCGCTCCCCGGAC; SEQ ID NO. 105) and moeO5NdeIup (AAACATATGCTCGCCCGGCTGCGC; SEQ ID NO. 106), resulting in an approximately 774 bp fragment. The amplification product may then be digested with HindIII and NdeI restriction endonucleases and cloned into respective sites of E. coli protein expression vector, such as pET24b (or any analogous vector that allows for affinity column protein purification). The resulting plasmid may then be introduced in an E. coli strain such as BL21(DE3) (or its derivatives that utilize similar strategy for induction of protein expression) using standard methods.

Alternatively, the moeO5 gene may be amplified along with its native ribosome binding site and a hexahistidine tag; desired restrictions sites may be engineered at the ends of the gene via PCR. This recombinant moeO5 gene can then be cloned into corresponding restrictions sites of a vector such as pMK19. The resulting plasmid can be introduced into S. ghanaensis using the above-described procedures. His-tagged moeO5 protein may then be purified from S. ghanaensis using standard IMAC chromatography.

The optimal conditions for moeO5 protein expression and purification from E. coli or Streptomyces can be developed experimentally; such methods and optimizations are well known to those skilled in the art. The pure moeO5 protein may then be used in vitro to, for example, reconstitute the reaction of prenyl transfer onto 3-phosphoglycerate. The ability of moeO5 to transfer unnatural prenyl chains onto 3-phosphoglycerate may also be exploited. The products of the reaction may be monitored by HPLC-MS. Since one of the disadvantage of moe A as a clinically valuable antibiotic is its poor pharmacokinetics, due in part to the long prenyl chain, the ability to produce the moes with altered prenyl chain length/stereochemistry would be a valuable step towards improved moes.

The above-described example provides an exemplary experimental framework for development of such technology for the production of improved moes. Additionally, if the moeO5 protein appears unable to transfer prenyl chains of shorter length/different stereochemistry, then it can be subjected to directed mutagenesis to relax/change its substrate specificity (see following example). The same approach as described above for moeO5 protein may be applied to the expression of any of moe proteins; that is, their ability to catalyze novel chemical reactions can be studied using a variety of different substrates and mutagenesis may be employed to change substrate specificity.

E. Site Specific Mutagenesis of Moe Genes in Order to Generate the Mutated Moe Proteins with Novel Enzymatic Activities.

Although the enzymes involved in antibiotic production are usually able to accept unnatural substrates resembling the natural one, their inherent level of substrate promiscuity might not be sufficient to produce a different compound. Accordingly, changes to specific amino acids which are identified as important for substrate recognition/reaction catalysis can be mutated within the given protein to generate highly efficient catalyst of novel reaction. Extracts from cells expressing the mutant moe biosynthesis-related gene (i.e. a “test strain”), may be assessed for anti-microbial activity using for example, a zone inhibition assay, such as depicted in FIG. 19. Comparison may be made between extracts from the test strain and extracts from a control strain (e.g. a strain expressing the natural moe biosynthesis-related genes). Test strains that have a larger zone of inhibition compared to the control strain may possess novel moes or moe derivatives. Test strains that have a smaller zone of inhibition compared to the control strain may lack novel moes or moe deriviatives. This approach depends on detailed structural information about the protein or, at least, its functional homologues. For example, the crystal structure of moeO5 homologue, GGGPS from Archaeoglobus fulgidus exists, as does information about kinetics of the reactions catalyzed by this class of enzymes. Using protein fold prediction programs, the structure of moeO5 may be modeled using A. fulgidus GGGPS as a template. In this way, amino acids critical for substrate recognition and binding may be identified within moeO5 and mutated to alter enzyme activity, for example, to help a catalytic pocket accommodate an altered or unnatural substrate. Alternatively, the entire polypeptide or portions of the polypeptide may be mutated at random sites to create mutant libraries (Lehtovaara et al., Protein Eng. 2: 63-8 (1988)). The mutants created in this way may also be tested, using, for example, the zone of inhibition assay described above.

F. Use of DNA Fragments of Moe Genes as a Hybridization Probes for Discovery of Genes Governing the Biosynthesis of Related Phosphoglycolipid Antibiotics

The moe genes of the present invention may be used to identify genes involved in the biosynthesis of other antibiotics in bacteria or other organisms. For example, the entire moeO5 gene (described above) may be labeled using non-radioactive digoxigenin or a radioactive approach (according to standard procedures) (Sambrook 1989). Genomic DNA of producers of other phosphoglycolipid antibiotics (e.g., AC326alpha, teichomycin, prasinomycin etc) may be isolated, digested with certain restriction endonucleases and separated on an agarose gel according to standard procedures. Southern analysis may then be used to identify any moeO5S homologues (i.e., the presence of positive hybridization signals using the labeled moeO5 gene with the genomic digests would demonstrate the presence of genes similar to moeO5). Therefore, the moe genes, such as moeO5, can be used to probe the genomic library of a given strain to identify, clone and characterize homologues and surrounding genes. Any other moe gene can be used in the same way as a probe for discovery of related genes in other producers.

G. Use of an Internal Fragment of Moe Genes and Homologous Recombination for Discovery of Genes Governing the Biosynthesis of Related Phosphoglycolipid Antibiotics

The carbamoyltransferase moeM5 disruption plasmid pOOB20a (described above) may be transferred into Actinoplanes teichomiceticus, the producer of teichomycins, via standard intergeneric conjugation protocol. The integration of pOOB20a into the A. teichomyceticus genome may be promoted as described for the moeM5 disruption in S. ghanaensis ATCC14672 strain. If there is a gene in the A. teichomyceticus genome which shows even moderate homology to moeM5 (50-70% at nucleotide level), then homologous recombination could occur between the moeM5 fragment on plasmid pOOB20a and the similar gene present in the host's genome. The integration of pOOB20a into A. teichomyceticus genome could be verified through Southern analysis, and the production of teichomycins may be monitored using bioassays and HPLC-MS analysis. Homologues of moeM5 and surrounding sequences could be rescued from pOOB20a+ A. teichomyceticus integrant by digesting total DNA with restriction endonucleases which do not cut the pOOB20a plasmid (HindIII, XbaI, EcoRV). In this way pOOB20a can be excised from the A. teichomyceticus genome along with some genomic flanking sequence (the amount of flanking sequence obtained would depend on how far the restriction sites were from the pOOB20a insertion). These digests may be set-up to be self-ligating and then used to transform competent cells such as E. coli. The apramycin-resistant clones resulting from this transformation can then be used for the isolation of plasmid DNA and further restriction mapping. The fragments of A. teichomyceticus genome can also be subcloned into suitable vectors (e.g., pUC19, pBluescript etc.) for sequencing. The described procedure would be useful to identify homologues or genes that have sequence similarity to any of the moe biosynthesis-related genes of the invention.

H. Design of Degenerate Primers on the Basis of Moe Genes for Discovery of Novel Genes

One method to identify novel genes (having homology to the moe genes described above) may include designing degenerate primers capable of amplifying sequences similar to those of the identified moe genes. For example, to design such primers, the sequence of a moe gene, such as the moeO5 gene, may be aligned with sequences of known prenyl-glycerol synthases from other bacterial species, such as Archaea. The conserved amino acid residues may be identified and their significance assessed through comparison with the crystal structure of prenyl-glycerol synthases from other bacteria, such as A. fulgidus. Two stretches of amino acids within a conserved C-terminus of the compared proteins (e.g., 88-ADALLL-93 (SEQ ID NO: 252) and 190-GADYVG-195 (SEQ ID NO: 253)) can be back-translated into DNA sequence taking into account the codon usage of Streptomyces (if other organisms are targeted in this kind of experiment, then the choice of codons is planned according to codon usage table of that organism) and allowing for ambiguity in a third codon position. The program CODEHOP (Rose et al., Nucl Acids Res, 1998 26:1628-1635) can be used to design degenerate primers.

PCR conditions using such degenerate primers may be developed in the course of additional experimentation using positive controls (e.g., the template DNA of the moe producer). Such PCR optimizations are well known in the art. The PCR products from unknown strains may then be cloned using, for example, PCR LIC cloning vectors (Novagen, San Diego, Calif.) according to the manufacturer's instructions. The insert can be sequenced compared with and moeO5 and other known homologues.

Examples F, G and H all describe methods in which novel moe gene homologues can be identified and isolated. Because the methods are based on sequence similarity, they allow for the discovery of genes which may have different biochemical or functional characteristics than the moe genes used to find them. Such characteristics may include but are not limited to different substrate specificity (e.g., more or less specific for a particular substrate; specificity for a different, but similar substrate); a different reaction rate (e.g., much faster or much slower, thereby allowing other reactions or modifications to occur or not); the ability to function under different reaction conditions (e.g., modified salt, temperature or pH); improved or increased stability. Any of the discovered homologues with different characteristics could prove useful in an in vitro or in vivo moe A synthesis and/or modification system, in conjunction with, or in place of, one of the identified moe genes. Additionally, identification of such moe homologues may lead to the identification of different moe-like or other antibiotic biosynthetic pathway.

I. Generating and Testing Novel Moes

The present invention provides for the generating of novel moe or moe derivatives which are capable of inhibiting the activity of bacterial transglycoslase enzymes. The novel moes may be generated according to the methods described herein, including the expression of moe biosynthesis-related genes, or fragments or variants thereof, in S. ghanaensis or a heterologous host, e.g. S. lividans. Extracts from cells expressing the moe biosynthesis-related genes, fragments, or variants thereof (i.e. a “test strain”), may be assessed for anti-microbial activity using for example, a zone inhibition assay, such as depicted in FIG. 19. Comparison may be made between extracts from the test strain and extracts from a control strain (e.g. a strain expressing the natural moe biosynthesis-related genes). Test strains that have a larger zone of inhibition compared to the control strain may possess novel biologically-active moes or moe derivatives. Test strains that have a smaller zone of inhibition compared to the control strain may lack novel moes or moe deriviatives.

Novel moes or moe derivatives might also be identified using LC-MS analysis of extracts from cells expressing the moe biosynthesis-related genes, fragments, or derivatives thereof. Comparison may be made between the LC-MS spectra of the test strain and a control strain. The appearance of new peaks may indicate the presence of novel moes or moe derivatives in that test strain. Those compounds may be purified using methods known to those of skill in the art (e.g. chromatography). The anti-microbial activity of those compounds can then be assayed using, for example, a zone of inhibition assay.

J. Improved Moe A Characteristics

In some embodiments, the modified and improved moe A formulations of the present invention (“improved moe A”) are contemplated to exhibit increased bioavailability as compared to the currently available, or conventional moe A formulations such as Flavomycin®. The increased bioavailability is also likely to result in a dosage form that exhibits greater drug absorption than conventional formulations of moe A; as such, a pharmaceutically acceptable formulation for use in humans is contemplated.

Increased bioavailability can be ascertained by methods known in the art. For example, the drug absorption, distribution, and/or elimination rates may be evaluated and compared to conventional moe A, as may the different pharmacokinetic profiles. Exemplary, desirable pharmacokinetic profiles preferably include, but are not limited to: (1) a C_(max) for an improved moe, such as an improved moe A, or a derivative or salt thereof, when assayed in the plasma of a mammalian subject following administration, that is preferably greater than the C_(max) for the conventional moe A administered at the same dosage; and/or (2) an AUC for an improved moe, such as an improved moe A, or a derivative or a salt thereof, when assayed in the plasma of a mammalian subject following administration, that is preferably greater than the AUC for conventional moe A, administered at the same dosage; and/or (3) a T_(max) for an improved moe, such as moe A, or a derivative or a salt thereof, when assayed in the plasma of a mammalian subject following administration, that is preferably less than the T_(max) for a conventional formulation moe A, administered at the same dosage. The desirable pharmacokinetic profile, as used herein, is the pharmacokinetic profile measured after the initial dose of the moe A or derivative or a salt thereof.

For example, in one embodiment, a composition comprising at least one improved formulation of moe A exhibits in comparative pharmacokinetic testing with a non-improved formulation of the same moe A (e.g. Flavomycin), administered at the same dosage, a T_(max) not greater than about 90%, not greater than about 80%, not greater than about 70%, not greater than about 60%, not greater than about 50%, not greater than about 30%, not greater than about 25%, not greater than about 20%, not greater than about 15%, not greater than about 10%, or not greater than about 5% of the T_(max) exhibited by the conventional moe A formulation.

In another embodiment, the composition comprising at least one improved moe A formulation or derivative or salt thereof, exhibits in comparative pharmacokinetic testing with a conventional moe A formulation (e.g., Flavomycin), administered at the same dosage, a C_(max) which is at least about 50%, at least about 100%0, at least about 200%, at least about 300%, at least about 400%, at least about 500%, at least about 600%, at least about 700%, at least about 800%, at least about 900%, at least about 1000%, at least about 1100%, at least about 1200%, at least about 1300%, at least about 14000%, at least about 1500%, at least about 1600%, at least about 1700%, at least about 1800%, or at least about 1900% greater than the C_(max) exhibited by the conventional moe A formulation.

In yet another embodiment, the composition comprising at least one improved moe A or a derivative or salt thereof, exhibits in comparative pharmacokinetic testing with a conventional formulation of moe A (e.g., Flavomycin), administered at the same dosage, an AUC which is at least about 25%, at least about 50%, at least about 75%, at least about 100%, at least about 125%, at least about 150%, at least about 175%, at least about 200%, at least about 225%, at least about 250%, at least about 275%, at least about 300%, at least about 350%, at least about 400%, at least about 450%, at least about 500%, at least about 550%, at least about 600%, at least about 750%, at least about 700%, at least about 750%, at least about 800%, at least about 850%, at least about 900%, at least about 950%, at least about 1000%, at least about 1050%, at least about 1100%, at least about 1150%, or at least about 1200% greater than the AUC exhibited by the conventional moe A formulation.

The moe A formulations contemplated also include a variety of pharmaceutical acceptable dosage forms. By way of example, but not by way of limitation, pharmaceutically acceptable formulations may include: formulation for oral, pulmonary, intravenous, rectal, ophthalmic, colonic, parenteral, intracisternal, intravaginal, intraperitoneal, local, buccal, nasal, and topical administration; dosage forms such as liquid dispersions, gels, aerosols, ointments, creams, tablets, sachets and capsules; dosage forms such as lyophilized formulations, fast melt formulations, controlled release formulations, delayed release formulations, extended release formulations, pulsatile release formulations, and mixed immediate release and controlled release formulations, or any combination of the above. In some embodiments, preferred formulations for administration may include oral tablets or capsules. In other embodiments, parenteral formulations may be preferred.

K. Prophylactic and Therapeutic Use of Moe A Derivatives

General.

The moe A derivatives and intermediates of the present invention can be used in treatment of bacterial infections. Specifically, the invention provides for both prophylactic and therapeutic methods of treating a subject at risk of (or susceptible to) a disorder or having a disorder associated with an bacterial infections. While not wishing to be limited by theory, administration of moe A results in inhibition of bacterial transglycosylase enzymes and killing or slowing the growth of the bacteria.

In one aspect, the invention provides a method for preventing, in a subject, a disease or condition associated with a bacterial infection, by administering to the subject a moe A derivative. Administration of a prophylactic moe A derivative can occur prior to the manifestation of symptoms characteristic of the infection, such that a disease or condition is prevented or, alternatively, delayed in its progression. In therapeutic applications, moe A derivatives are administered to a subject suspected of, or already suffering from, a bacterial infection. An amount adequate to accomplish therapeutic or prophylactic treatment is defined as a therapeutically- or prophylactically-effective dose.

Determination of the Biological Effect of a Moe A Derivative Therapeutic.

In various embodiments of the invention, suitable in vitro or in vivo assays are performed to determine the effect of moe A derivatives and whether the administration is indicated for treatment of the affected tissue in a subject.

Typically, an effective amount of the compositions of the present invention, sufficient for achieving a therapeutic or prophylactic effect, range from about 0.000001 mg per kilogram body weight per day to about 10,000 mg per kilogram body weight per day. Preferably, the dosage ranges are from about 0.0001 mg per kilogram body weight per day to about 100 mg per kilogram body weight per day. For example dosages can be 1 mg/kg body weight or 10 mg/kg body weight every week, every two weeks or every three weeks or within the range of 1-10 mg/kg every week, every two weeks or every three weeks. In one embodiment, a single dosage of antibody range from 0.1-10,000 micrograms per kg body weight. In one embodiment, antibody concentrations in a carrier range from 0.2 to 2000 micrograms per delivered milliliter. An exemplary treatment regime entails administration once per every two weeks or once a month or once every 3 to 6 months. Alternatively, moe A derivatives can be administered as a sustained release formulation, in which case less frequent administration is required. Dosage and frequency vary depending on the half-life of the moe A derivative in the subject. The dosage and frequency of administration can vary depending on whether the treatment is prophylactic or therapeutic. In prophylactic applications, a relatively low dosage is administered at relatively infrequent intervals over a long period of time. Some subjects continue to receive treatment for the rest of their lives. In therapeutic applications, a relatively high dosage at relatively short intervals is sometimes required until progression of the disease is reduced or terminated, and preferably until the subject shows partial or complete amelioration of symptoms of disease. Thereafter, the patent can be administered a prophylactic regime.

Toxicity.

Preferably, an effective amount (e.g., dose) of a moe A derivative described herein will provide therapeutic benefit without causing substantial toxicity to the subject. Toxicity of the moe A derivative described herein can be determined by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., by determining the LD₅₀ (the dose lethal to 50% of the population) or the LD₁₀₀ (the dose lethal to 100% of the population). The dose ratio between toxic and therapeutic effect is the therapeutic index. The data obtained from these cell culture assays and animal studies can be used in formulating a dosage range that is not toxic for use in human. The dosage can vary within this range depending upon the dosage form employed and the route of administration utilized. The exact formulation, route of administration and dosage can be chosen by the individual physician in view of the subject's condition. See, e.g., Fingl et al., In: The Pharmacological Basis of Therapeutics, Ch. 1 (1975).

Formulations of Pharmaceutical Compositions.

According to the methods of the present invention, the moe A derivative can be incorporated into pharmaceutical compositions suitable for administration. Pharmaceutically-acceptable carriers are determined in part by the particular composition being administered, as well as by the particular method used to administer the composition. Accordingly, there is a wide variety of suitable formulations of pharmaceutical compositions for administering the antibody compositions (see, e.g., Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, Pa. 18^(th) ed., 1990). The pharmaceutical compositions are generally formulated as sterile, substantially isotonic and in full compliance with all Good Manufacturing Practice (GMP) regulations of the U.S. Food and Drug Administration.

The terms “pharmaceutically-acceptable,” “physiologically-tolerable,” and grammatical variations thereof, as they refer to compositions, carriers, diluents and reagents, are used interchangeably and represent that the materials are capable of administration to or upon a subject without the production of undesirable physiological effects to a degree that would prohibit administration of the composition. For example, “pharmaceutically-acceptable excipient” means an excipient that is useful in preparing a pharmaceutical composition that is generally safe, non-toxic, and desirable, and includes excipients that are acceptable for veterinary use as well as for human pharmaceutical use. Such excipients can be solid, liquid, semisolid, or, in the case of an aerosol composition, gaseous. “Pharmaceutically-acceptable salts and esters” means salts and esters that are pharmaceutically-acceptable and have the desired pharmacological properties. Such salts include salts that can be formed where acidic protons present in the moe A derivative are capable of reacting with inorganic or organic bases. Suitable inorganic salts include those formed with the alkali metals, e.g., sodium and potassium, magnesium, calcium, and aluminum. Suitable organic salts include those formed with organic bases such as the amine bases, e.g., ethanolamine, diethanolamine, triethanolamine, tromethamine, N-methylglucamine, and the like. Such salts also include acid addition salts formed with inorganic acids (e.g., hydrochloric and hydrobromic acids) and organic acids (e.g., acetic acid, citric acid, maleic acid, and the alkane- and arene-sulfonic acids such as methanesulfonic acid and benzenesulfonic acid). Pharmaceutically-acceptable esters include esters formed from carboxy, sulfonyloxy, and phosphonoxy groups present in the moe A derivative, e.g., C₁₋₆ alkyl esters. When there are two acidic groups present, a pharmaceutically-acceptable salt or ester can be a mono-acid-mono-salt or ester or a di-salt or ester, and similarly where there are more than two acidic groups present, some or all of such groups can be salified or esterified. The moe A derivative named in this invention can be present in unsalified or unesterified form, or in salified and/or esterified form, and the naming of such moe A derivative is intended to include both the original (unsalified and unesterified) compound and its pharmaceutically-acceptable salts and esters. Also, certain moe A derivatives named in this invention can be present in more than one stereoisomeric form, and the naming of such moe A derivatives is intended to include all single stereoisomers and all mixtures (whether racemic or otherwise) of such stereoisomers. A person of ordinary skill in the art, would have no difficulty determining the appropriate timing, sequence and dosages of administration for particular drugs and compositions of the present invention.

Preferred examples of such carriers or diluents include, but are not limited to, water, saline, Ringer's solutions, dextrose solution, and 5% human serum albumin. Liposomes and non-aqueous vehicles such as fixed oils may also be used. The use of such media and compounds for pharmaceutically active substances is well known in the art. Except insofar as any conventional media or compound is incompatible with the moe A derivative, use thereof in the compositions is contemplated. Supplementary active compounds can also be incorporated into the compositions.

A pharmaceutical composition of the invention is formulated to be compatible with its intended route of administration. The moe A derivative compositions of the present invention can be administered by parenteral, topical, intravenous, oral, subcutaneous, intraarterial, intradermal, transdermal, rectal, intracranial, intraperitoneal, intranasal; intramuscular route or as inhalants. The moe A derivative can optionally be administered in combination with other agents that are at least partly effective in treating conditions associated with bacterial infection.

Solutions or suspensions used for parenteral, intradermal, or subcutaneous application can include the following components: a sterile diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial compounds such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating compounds such as ethylenediaminetetraacetic acid (EDTA); buffers such as acetates, citrates or phosphates, and compounds for the adjustment of tonicity such as sodium chloride or dextrose. The pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be enclosed in ampoules, disposable syringes or multiple dose vials made of glass or plastic.

Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions (where water soluble) or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersion. For intravenous administration, suitable carriers include physiological saline, bacteriostatic water, Cremophor ELTM (BASF, Parsippany, N.J.) or phosphate buffered saline (PBS). In all cases, the composition must be sterile and should be fluid to the extent that easy syringeability exists. It must be stable under the conditions of manufacture and storage and must be preserved against the contaminating action of microorganisms such as bacteria and fungi. The carrier can be a solvent or dispersion medium containing, e.g., water, ethanol, polyol (e.g., glycerol, propylene glycol, and liquid polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity can be maintained, e.g., by the use of a coating such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of surfactants. Prevention of the action of microorganisms can be achieved by various antibacterial and antifungal compounds, e.g., parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic compounds, e.g., sugars, polyalcohols such as manitol, sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable compositions can be brought about by including in the composition a compound which delays absorption, e.g., aluminum monostearate and gelatin.

Sterile injectable solutions can be prepared by incorporating the moe A derivative in the required amount in an appropriate solvent with one or a combination of ingredients enumerated above, as required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the binding agent into a sterile vehicle that contains a basic dispersion medium and the required other ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, methods of preparation are vacuum drying and freeze-drying that yields a powder of the active ingredient plus any additional desired ingredient from a previously sterile-filtered solution thereof. The agents of this invention can be administered in the form of a depot injection or implant preparation which can be formulated in such a manner as to permit a sustained or pulsatile release of the active ingredient.

Oral compositions generally include an inert diluent or an edible carrier. They can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic administration, the binding agent can be incorporated with excipients and used in the form of tablets, troches, or capsules. Oral compositions can also be prepared using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is applied orally and swished and expectorated or swallowed. Pharmaceutically compatible binding compounds, and/or adjuvant materials can be included as part of the composition. The tablets, pills, capsules, troches and the like can contain any of the following ingredients, or compounds of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a disintegrating compound such as alginic acid, Primogel, or corn starch; a lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a sweetening compound such as sucrose or saccharin; or a flavoring compound such as peppermint, methyl salicylate, or orange flavoring.

For administration by inhalation, the moe A derivative are delivered in the form of an aerosol spray from pressured container or dispenser which contains a suitable propellant, e.g., a gas such as carbon dioxide, or a nebulizer.

Systemic administration can also be by transmucosal or transdermal means. For transmucosal or transdermal administration, penetrants appropriate to the barrier to be permeated are used in the formulation. Such penetrants are generally known in the art, and include, e.g., for transmucosal administration, detergents, bile salts, and fusidic acid derivatives. Transmucosal administration can be accomplished through the use of nasal sprays or suppositories. For transdermal administration, the moe A derivative is formulated into ointments, salves, gels, or creams as generally known in the art.

The moe A derivative can also be prepared as pharmaceutical compositions in the form of suppositories (e.g., with conventional suppository bases such as cocoa butter and other glycerides) or retention enemas for rectal delivery.

In one embodiment, the moe A is prepared with carriers that will protect the moe A derivative against rapid elimination from the body, such as a controlled release formulation, including implants and microencapsulated delivery systems. Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation of such formulations will be apparent to those skilled in the art. The materials can also be obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutically-acceptable carriers. These can be prepared according to methods known to those skilled in the art, e.g., as described in U.S. Pat. No. 4,522,811.

It is especially advantageous to formulate oral or parenteral compositions in dosage unit form for ease of administration and uniformity of dosage. Dosage unit form as used herein refers to physically discrete units suited as unitary dosages for the subject to be treated; each unit containing a predetermined quantity of binding agent calculated to produce the desired therapeutic effect in association with the required pharmaceutical carrier. The specification for the dosage unit forms of the invention are dictated by and directly dependent on the unique characteristics of the compound and the particular therapeutic effect to be achieved, and the limitations inherent in the art of compounding such moe A derivatives for the treatment of a subject.

REFERENCES

-   Adachi, M., Zhang, Y., Leimkuhler, C., Sun, B., LaTour, J. V., and     Kahne, D. E. (2006), Degradation and reconstruction of moenomycin A     and derivatives: dissecting the function of the isoprenoid chain. J     Am Chem Soc. 128, 14012-14013. -   Arai M, Torikata A, Enokita R, Fukatsu H, Nakayama R and Yoshida K.     Pholipomycin, a new member of phosphoglycolipid antibiotics. I.     Taxonomy of producing organism and fermentation and isolation of     pholipomycin. J Antibiot. 1977. V. 30(12): 1049-1054. -   Baizman E R, Branstrom A A, Longley C B, Allanson N, Sofia M J,     Gange D, Goldman R C: Antibacterial activity of synthetic analogues     based on disaccharide structure of moe, an inhibitor of bacterial     transglycosylase. Microbiology 2000, 146: 3129-3140. -   Bardone M R, Paternoster M and Coronelli C. Teichomycins, new     antibiotics from Actinoplanes teichomyceticus nov sp. II. Extraction     and chemical characterization. J Antibiot. 1978. V. 31(3): 170-177. -   Belanger M, Burrows L L and Lam J S. Functional analysis of genes     responsible for the synthesis of the B-band O antigen of Pseudomonas     aeruginosa serotype 06 lipopolysaccharide. Microbiology. 1999. V.     145:3505-3521. -   Bentley S D, Chater K F, Cerdeno-Tarraga A M, Challis G L, Thomson N     R, James K D, Harris D E, Quail M A, Kieser H, Harper D, Bateman A,     Brown S, Chandra G, Chen C W, Collins M, Cronin A, Fraser A, Goble     A, Hidalgo J, Hornsby T, Howarth S, Huang C H, Kieser T, Larke L,     Murphy L, Oliver K, O'Neil S, Rabbinowitsch E, Rajandream M A,     Rutherford K, Rutter S, Seeger K, Saunders D, Sharp S, Squares R,     Squares S, Taylor K, Warren T, Wietzorrek A, Woodward J, Barrell B     G, Parkhill J, Hopwood D A. Complete genome sequence of the model     actinomycete Streptomyces coelicolor A3(2). Nature. 2002.     417:141-147. -   Bibb M J. Regulation of secondary metabolism in streptomycetes. Curr     Opin Microbiol. 2005. V. 8:208-215. -   Bierman M, Logan R, O'Brien K, Seno E T, Rao R N, Schoner B E.     Plasmid cloning vectors for the conjugal transfer of DNA from     Escherichia coli to Streptomyces spp. Gene. 1992. 116:43-49. -   Blondelet-Rouault, M. H., Weiser, J., Lebrihi, A., Branny, P., and     Pernodet, J. L. (1997). Antibiotic resistance gene cassettes derived     from the omega interposon for use in E. coli and Streptomyces. Gene     190, 315-317. -   Chaffin D O, McKinnon K and Rubens C E. CpsK of Streptococcus     agalactiae exhibits α2,3-sialyltransferase activity in Haemophilus     ducreyi. Mol Microbiol. 2002. V. 45(1):109-122. -   Chang G. Multidrug resistance ABC transporters. FEBS Lett. 2003 Nov.     27; 555(1):102-5. -   Chater K F. Streptomyces inside-out: a new perspective on the     bacteria that provide us with antibiotics. Phil Trans R     Soc B. 2006. V. 361:761-768. -   Chen L, Walker D, Sun B, Hu Y, Walker S, Kahne D: Vancomycin     analogues active against vanA-resistant strains inhibit bacterial     transglycosylase without binding substrate. Proc Natl Acad Sci USA     2003, 100: 5658-5663. -   Dairi T. Studies on biosynthetic genes and enzymes of isoprenoids     produced by actinomycetes. J Antibiot (Tokyo). 2005 April;     58(4):227-43. -   Datsenko, K. A., and Wanner, B. L. (2000). One-step inactivation of     chromosomal genes in Escherichia coli K-12 using PCR products. Proc.     Natl. Acad. Sci. USA 97, 6640-6645. -   Decker H, Gaisser S, Pelzer S, Schneider P, Westrich L, Wohlleben W,     Bechthold A. A general approach for cloning and characterizing     dNDP-glucose dehydratase genes from actinomycetes. FEMS Microbiol     Lett. 1996. V. 141: 195-201. -   Du, Y., Li, T., Wang, Y. G. and Xia, H. Identification and     Functional Analysis of dTDP-Glucose-4,6-Dehydratase Gene and Its     Linked Gene Cluster in an Aminoglycoside Antibiotics Producer of     Streptomyces tenebrarius H Curt. Microbiol. 49 (2), 99-107 (2004). -   Durr C, Schnell H-J, Luzhetskyy A, Murillo R, Weber M, Welzel K,     Vente A and Bechthold A. Biosynthesis of the terpene     phenalinolactone in Streptomyces sp. Tu6071: analysis of the gene     cluster and generation of derivatives. Chem Biol. 2006. V.     13:365-377. -   Eichhorn P and Aga D. Characterization of moe antibiotics from     medicated chicken feed by ion-trap mass spectrometry with     electrospray ionization. Rapid Commun Mass Spectrom. 2005. V.     19:2179-2186. -   Feng L, Tao J, Guo H, Xu J, Li Y, Rezwan F, Reeves P, Wang L.     Structure of the Shigella dysenteriae 7 O antigen gene cluster and     identification of its antigen specific genes. Microb Pathog. 2004     February; 36(2):109-15. -   Flett F, Mersinias V, Smith C P. High efficiency intergeneric     conjugal transfer of plasmid DNA from Escherichia coli to methyl     DNA-restricting streptomycetes. FEMS Microbiol Lett. 1997.     155:223-229. -   Garneau S, Qiao L, Chen L, Walker S, Vederas J C: Synthesis of mono-     and disaccharide analogs of moe and lipid II for inhibition of     transglycosylase activity of penicillin-binding protein 1b. Bioorg     Med Chem 2004, 12: 6473-6494. -   Genetika. 2001 October; 37(10):1340-7. Russian. -   Goldman R C, Baizman E R, Branstrom A A, Longley C B: Differential     antibacterial activity of moe analogues on gram-positive bacteria.     Bioorg Med Chem Lett 2000, 10: 2251-2254. -   Goldman, R. C., and Gange, D. (2000). Inhibition of     transglycosylation involved in bacterial peptidoglycan synthesis.     Curr. Med. Chem. 7, 801-820. -   Gromyko O M, Rebets YuV, Ostash B, Luzhetskyy A, Fukuhara M,     Bechthold A, Nakamura T and Fedorenko V. Generation of Streptomyces     globisporus SMY622 strain with increased landomycin E production and     it's initial characterization. J Antibiot. 2004. V. 57:383-389. -   Gust B, Challis G L, Fowler K, Kieser T, Chater K F. PCR-targeted     Streptomyces gene replacement identifies a protein domain needed for     biosynthesis of the sesquiterpene soil odor geosmin. Proc Natl Acad     Sci USA. 2003. 100:1541-1546. -   Halliday J, McKeveney D, Muldoon C, Rajaratnam P, Meutermans W.     Targeting the forgotten transglycosylases. Biochem Pharmacol. 2006     Mar. 30; 71(7):957-67. -   He H, Shen B, Korshalla J, Siegel M M and Carter G T. Isolation and     structural elucidation of AC326-α, a new member of the moe group. J     Antibiot. 2000. V. 53(2): 191-195. -   Heijenoort van J: Formation of glycan chains in the synthesis of     bacterial peptidoglycan. Glycobiol 2001, 11: 25R-36R. -   Hodgson, D. A. Primary metabolism and its control in streptomycetes:     a most unusual group of bacteria. (2000). Adv. Microb. Physiol. 42,     47-238. -   Hong H J, Paget M S, Buttner M J. A signal transduction system in     Streptomyces coelicolor that activates the expression of a putative     cell wall glycan operon in response to vancomycin and other cell     wall-specific antibiotics. Mol Microbiol. 2002 June;     44(5):1199-1211. -   Hong Y S, Lee D, Kim W, Jeong J K, Kim C G, Sohng J K, Lee J H, Paik     S G, Lee J J. Inactivation of the carbamoyltransferase gene refines     post-polyketide synthase modification steps in the biosynthesis of     the antitumor agent geldanamycin. J Am Chem Soc. 2004 Sep. 15;     126(36):11142-3. -   Hopwood D. Soil to genomics: the Streptomyces chromosome. Ann Rev     Microbiol. 2006. V. 40:1-23 (epub ahead of print) -   Ishikawa J, Hotta K. FramePlot: a new implementation of the Frame     analysis for the predicting protein-coding regions in the bacterial     DNA with a high G+C content. FEMS Microbiol Lett. 1999. V.     174:251-253. -   Iyobe S, Mitsuhashi S and Saito T. Sex pili mutants isolated by     macarbomycin treatment. Antimicrob Agents Chemother. 1973.     3(5):614-620. -   Jabbouri S, Fellay R, Talmont F, Kamalaprija P, Burger U, Relic B,     Prome J C, Broughton W J. Involvement of nodS in N-methylation and     nodU in 6-O-carbamoylation of Rhizobium sp. NGR234 nod factors. J     Biol Chem. 1995 Sep. 29; 270(39):22968-73. -   Kaplan J, Velliyagounder K, Ragunath C, Rohde H, Mack D, Knobloch J     K-M and Ramamsubbu N. Genes involved in the synthesis and     degradation of matrix polysaccharide in Actinobacillus     actinomycetemcomitans and Actinobacillus pleuropneumoniae biofilms.     J Bacteriol. 2004. V. 186:8213-8220. -   Kaur P. Expression and characterization of DrrA and DrrB proteins of     Streptomyces peucetius in Escherichia coli: DrrA is an ATP binding     protein. J Bacteriol. 1997 February; 179(3):569-75. -   Kawasaki T, Hamano Y, Kuzuyama T, Itoh N, Seto H and Dairi T.     Interconversion of the product specificity of type I eubacterial     farnesyl diphosphate synthase and geranylgeranyl diphosphate     synthase through one amino acid substitution. J Biochem. 2003. V.     133:83-91. -   Kawasaki T, Hayashi Y, Kuzuyama T, Furihata K, Itoh N, Seto H,     Dairi T. Biosynthesis of a natural polyketide-isoprenoid hybrid     compound, furaquinocin A: identification and heterologous expression     of the gene cluster. J Bacteriol. 2006. 188(4):1236-44. -   Kieser T, Bibb M J, Buttner M J, Chater K F and Hopwood D A.     Practical Streptomyces genetics. 2000. Norwich, England: The John     Innes Foundation. -   Knirel Y A, Dashunin V V, Shashkov A S, Kochetkov N K, Dmitriev B A,     Hofman I L. Somatic antigens of Shigella: structure of the     O-specific polysaccharide chain of the Shigella dysenteriae type 7     lipopolysaccharide. Carbohydrate Res. 1988. V. 179:51-60. -   Kudo F, Kawabe K, Kuriki H, Eguchi T, Kakinuma K. A new family of     glucose-1-phosphate/glucosamine-1-phosphate nucleotidylyltransferase     in the biosynthetic pathways for antibiotics. J Am Chem Soc. 2005     Feb. 16; 127(6):1711-8. -   Leskiw B K, Mah R, Lawlor E J and Chater K F. Accumulation     ofbldA-specified tRNA is temporally regulated in Streptomyces     coelicolor A3(2). J Bacteriol. 1993. V. 175:1995-2005. -   Leskiw, B. K., Lawlor, E J., Fernandez-Abalos, J. M., and     Chater, K. F. (1991). TTA codons in some genes prevent their     expression in a class of developmental, antibiotic-negative,     Streptomyces mutants. Proc. Natl. Acad. Sci. USA. 88, 2461-2465. -   Lin W S, Cunneen T, Lee C Y. Sequence analysis and molecular     characterization of genes required for the biosynthesis of type 1     capsular polysaccharide in Staphylococcus aureus. J Bacteriol. 1994     November; 176(22):7005-16. -   Lindner 1961 -   Liu H, Ritter T K, Sadamoto R, Sears P S, Wu M, Wong C H. Acceptor     specificity and inhibition of the bacterial cell-wall     glycosyltransferase MurG. Chembiochem. 2003.4:603-609. -   Lombo, Felipe, Brana A. F., Mendez, C and Salas J. A. The     mirithramycin gene cluster of Streptomyces argillaceus contains a     positive regulatory gene and two repeated DNA sequences that are     located at both ends of the cluster. J. Bacteriol. 1999 January;     181(2):642-647. -   Lovering, A. L., de Castro, L. H., Lim, D., and Strynadka, N. C.     (2007). Structural insight into the transglycosylation step of     bacterial cell-wall biosynthesis. Science. 315, 1402-1405. -   Luzhetskyy A, Fedoryshyn M, Durr C, Taguchi T, Novikov V and     Bechthold A. Iteratively acting glycosyltransferases involved in the     hexasaccharide biosynthesis of landomycin A. Chem Biol. 2005. V.     12:725-729. -   Luzhetskii A N, Ostash B E, Fedorenko V A. Interspecies conjugation     of Escherichia coli-Streptomyces globisporus 1912 using integrative     plasmid pSET152 and its derivatives] -   McAlpine J B, Bachmann B O, Piraee M, Tremblay S, Alarco A M,     Zazopoulos E, Farnet C M. Microbial genomics as a guide to drug     discovery and structural elucidation: ECO-02301, a novel antifungal     agent, as an example. J Nat Prod. 2005. 68(4):493-6. -   McKeegan K S, Borges-Walmsley M I and Walmsley A R. The structure     and function of drug pumps: an update. Trends Microbiol. 2003. V.     11(1):21-28. -   Men H, Park P, Ge M and Walker S. Substrate synthesis and activity     assay for MurG. J Am Chem Soc. 1998. V. 120:2484-2485. -   Mendez C, Salas J A. The role of ABC transporters in     antibiotic-producing organisms: drug secretion and resistance     mechanisms. Res Microbiol. 2001. 152(3-4):341-50. -   Meyers E, Smith D, Slusarchyk W A, Bouchard J L and Weisenbom F L.     The diumycins. New members of an antibiotic family having prolonged     in vivo activity. J Antibiot. 1969. V. 22:490-493. -   Murrell J M, Liu W, Shen B. Biochemical characterization of the     SgcA1 alpha-D-glucopyranosyl-1-phosphate thymidylyltransferase from     the enediyne antitumor antibiotic C-1027 biosynthetic pathway and     overexpression of sgcA1 in Streptomyces globisporus to improve     C-1027 production. J Nat Prod. 2004 February; 67(2):206-13. -   Muth G, Nussbaumer B, Wohlleben W and Puhler A. A vector system with     temperature-sensitive replication for gene disruption and mutational     cloning in streptomycetes. Mol Gen Genet. 1989. V. 219: 341-348. -   Nakagawa A, Wu T-S, Keller P J, Lee J P, Omura S, Floss H G.     Biosynthesis of asukamycin. Formation of the     2-amino-3-hydroxycyclopent-2-enone moiety. J Chem Soc Chem     Commun. 1985. P. 519-521. -   Nemoto N, Oshima T, Yamagishi A. Purification and characterization     of geranylgeranylglyceryl phosphate synthase from a     thermoacidophilic archaeon, Thermoplasma acidophilum. J Biochem     (Tokyo). 2003. 133(5):651-657. -   Neundorf I, Kohler C, Hennig L, Findeisen M, Arigoni D and Welzel P.     Evidence for the combined participation of a C₁₀ and a C₁₅ precursor     in the biosynthesis of moenocinol, the lipid part of the moe     antibiotics. ChemBioChem. 2003. V. 4:1201-1205. -   Oh S H, Chater K F. Denaturation of circular or linear DNA     facilitates targeted integrative transformation of Streptomyces     coelicolor A3(2): possible relevance to other organisms. J     Bacteriol. 1997. 179(1):122-127. -   Ostash, B., Saghatelian, A., and Walker, S. (2007). A streamlined     metabolic pathway for the biosynthesis of moenomycin a. Chem Biol.     14, 257-267. -   Ostash B, Walker S. Bacterial transglycosylase inhibitors. Current     Opin Chem Biol. 2005. 9:459-466. -   Pacholec M, Freel Meyers C L, Oberthur M, Kahne D, Walsh C T.     Characterization of the aminocoumarin ligase SimL from the     simocyclinone pathway and tandem incubation with NovM,P,N from the     novobiocin pathway. Biochemistry. 2005 Mar. 29; 44(12):4949-56. -   Paton A W and Paton J C. Molecular characterization of the locus     encoding biosynthesis of the lipopolysaccharide O antigen of     Escherichia coli serotype 0113. Infection Immun. 1999. V.     67(11):5930-5937. -   Petricek M, Petrickova K, Havlicek L and Felsberg J. Occurrence of     two 5-aminolevulinate biosynthetic pathways in Streptomyces nodosus     subsp. asukaensis is linked with the production of asukamycin. J.     Bacteriol. 2006. V. 188(14): 5113-5123. -   Pfaller M A. Flavophospholipol use in animals: Positive implications     for antimicrobial resistance based on its microbiologic properties.     Diagn Microbiol Infect Dis. 2006 May 12; [Epub ahead of print]. -   Rascher A, Hu Z, Viswanathan N, Schirmer A, Reid R, Nierman W C,     Lewis M, Hutchinson C R. Cloning and characterization of a gene     cluster for geldanamycin production in Streptomyces hygroscopicus     NRRL3602. FEMS Microbiol Lett. 2003. V. 218:223-230. -   Rebets YuV, Ostash B O, Fukuhara M, Nakamura M and Fedorenko V O.     Expression of the regulatory protein LndI for landomycin E     production in Streptomyces globirporus 1912 is controlled by the     availability of tRNA for the rare UUA codon. FEMS Microbiol     Lett. 2006. V. 256:30-37. -   Redenbach M, Flett F, Piendl W, Glocker I, Rauland U, Wafzig O,     Kliem R, Leblond P, Cullum J. The Streptomyces lividans 66     chromosome contains a 1 MB deletogenic region flanked by two     amplifiable regions. Mol Gen Genet. 1993. 241:255-262. -   Riedl S, Ohlsen K, Werner G, Witte W, Hacker J. Impact of     flavophospholipol and -   vancomycin on conjugational transfer of vancomycin resistance     plasmids. Antimicrob Agents Chemother. 2000. 44(11):3189-92. -   Sambrook J, Fritsch E F and Maniatis T. Molecular cloning, a     laboratory manual. 1989. Cold Spring Harbor Laboratory, Cold Spring     Harbor, N.Y. -   Sambrook, J., and Russel, D. W. (2001) Molecular Cloning: A     Laboratory Manual (Cold Spring Harbor Lab. Press, Cold Spring     Harbor, N.Y.), 3^(rd) Ed. -   Schuricht U, Endler K, Hennig L, Findeisen M and Welzel P. Studies     on the biosynthesis of the antibiotic moe A. J Prakt Chem. 2000. V.     342(8). P. 761-772. -   Schuricht U, Hennig L, Findeisen M, Endler K, Welzel P and     Arigoni D. The biosynthesis of moenocinol, the lipid part of the moe     antibiotics. Tetrahedron Lett. 2001. V. 42:3835-3837. -   Sekurova, O. N., Brautaset, T., Sletta, H., Borgos, S. E.,     Jakobsen, M. O. M., Ellingsen, T. E., Strom, A. R., Valla, S., and     Zotchev, S. B. (2004). In vivo analysis of the regulatory genes in     the nystatin biosynthetic gene cluster of Streptomyces noursei     ATCC11455 reveals their differential control over antibiotic     biosynthesis. J. Bacteriol. 186, 1345-1354. -   Slusarchyk W A, Weisenborn F L. The structure of the lipid portion     of the antibiotic prasinomycin. Tetrahedron Lett. 1969.8: 659-662. -   Smith H E, Veenbergen V, Velde J, Damman M, Wisselink and Smits M A.     The cps genes of Streptococcus suis serotypes 1, 2 and 9:     development of rapid serotype-specific PCR assays. J Clin     Microbiol. 1999. V. 37:3146-3152. -   Soderberg T, Chen A and Poulter C D. Geranylgeranylglycerylphosphate     synthase. -   Characterization of the recombinant enzyme from Methanobacterium     thermoautotrophicum. Biochemistry. 2001. V. 40:14847-14854. -   Sosio M, Stinchi S, Beltrametti F, Lazzarini A and Donadio S. The     gene cluster for the biosynthesis of the glycopeptide antibiotic     A40926 by Nomomuraea species. Chem Biol. 2003. V. 10:541-549. -   Subramaniam-Niehaus, B., Schneider, T., Metzger, J. W. &     Wohleben, W. (1997). Isolation and analysis of moenomycin and its     biosynthetic intermediates from Streptomyces ghanaensis (ATCC14672)     wildtype and selected mutants. Z. Naturforsch. 52, 217-226. -   Tachibana A, Yano Y, Otani S, Nomura N, Sako Y, Taniguchi M. Novel     prenyltransferase gene encoding farnesylgeranyl diphosphate synthase     from a hyperthermophilic archaeon, Aeropyrum pernix. Molecular     evolution with alteration in product specificity. Eur J Biochem.     2000 January; 267(2):321-8. -   Tahlan K, Park H U, Jensen S E. Three unlinked gene clusters are     involved in clavam metabolite biosynthesis in Streptomyces     clavuligerus. Can J Microbiol. 2004. 50(10):803-10. -   Takahashi H, Liu Y N, Liu H W. A two-stage one-pot enzymatic     synthesis of TDP-L-mycarose from thymidine and glucose-1-phosphate.     J Am Chem Soc. 2006. 128(5):1432-1433. -   Takahashi S, Okanishi A, Utahara R, Nitta K, Maeda K, Umezawa H.     Macarbomycin, a new antibiotic containing phosphorus. J     Antibiot. 1970. V. 23(1). P: 48-50. -   Taylor, J. G., Li, X., Oberthur, M., Zhu, W., and Kahne, D. E.     (2006). The total synthesis of moenomycin A. J. Am. Chem. Soc. 128,     15084-15085. -   Thuy T T, Lee H C, Kim C G, Heide L, Sohng J K. Functional     characterizations of novWUS involved in novobiocin biosynthesis from     Streptomyces spheroides. Arch Biochem Biophys. 2005. 436(1):161-7. -   Trepanier N K, Jensen S E, Alexander D C and Leskiw B K. The     positive activator of cephamycin C and clavulanic acid production in     Streptomyces clavuligerus is mistranslated in a bldA mutant.     Microbiology. 2002. V. 148: 643-656. -   Wallhausser K H, Nesermann G, Prave P and Steigler A. Moe, a new     antibiotic. I. Fermentation and isolation. Antimicrob Agents     Chemother. 1965. P. 734-736. -   Wang X, Preston III J F and Romeo T. The pgaABCD locus of     Escherichia coli promotes the synthesis of a polysaccharide adhesin     required for biofilm formation. J Bacteriol. 2004. V. 186:2724-2734. -   Weber T, Welzel K, Pelzer S, Vente A, Wohlleben W. Exploiting the     genetic potential of polyketide producing streptomycetes. J     Biotechnol. 2003. V. 106: 221-232. -   Weisenborn F L, Bouchard J L, Smith D, Pansy F, Maestrone G,     Miraglia G, Meyers E. The prasinomycins: antibiotics containing     phosphorus. Nature. 1967. V. 213: P. 1092-1094. -   Welzel P, Kunisch F, Kruggel F, Stein H, Scherkenbeck J, Hiltmann A,     Duddeck H, Muller D, Maggio J E, Fehlhaber H-W, Seibert G, van     Heijenoort Y and van Heijenoort J. Moe A: minimum structural     requirements for biological activity. Tetrahedron 1987. V.     43(3):585-598. -   Welzel P: Transglycosylase inhibition. In Antibiotics and antiviral     compounds—chemical synthesis and modification. Edited by Krohn K,     Kirst H and Maag H. VCH Weinheim, Germany, 1993: 373-378. Welzel, P.     (2005). -   Welzel P. Syntheses around the transglycosylation step in     peptidoglycan biosynthesis. Chem Rev. 2005. V. 105:4610-4660. -   Westrich L, Domann S, Faust B, Bedford D, Hopwood D A, Bechthold A.     Cloning and characterization of a gene cluster from Streptomyces     cyanogenus S136 probably involved in landomycin biosynthesis. FEMS     Microbiol Lett. 1999. 170:381-387. -   Wilson V T and Cundliffe E. Molecular analysis of tirB, an     antibiotic-resistance gene from tylosin-producing     Streptomycesfradiae, and discovery of a novel resistance mechanism.     J Antibiot. 1999. V. 52 P: 288-296. -   Yuan, Y., Barrett, D., Zhang, Y., Kahne, D., Sliz, P., and     Walker, S. (2007). Crystal structure of a peptidoglycan     glycosyltransferase suggests a model for processive glycan chain     synthesis. Proc. Natl. Acad. Sci. USA. 104, 5348-5353. -   Yu T-W, Bai L, Clade D, Hoffman D, Toelzer S, Trihn K Q, Xu J, Moss     S J, Leistner E and Floss H G. The biosynthetic gene cluster of the     maytansinoid antitumor agent ansamitocin from Actinosynnema     pretiosum. Proc Natl Acad Sci. 2002. V. 99(12):7968-7973. -   Zalkin H, Smith J L. Enzymes utilizing glutamine as an amide donor.     Adv Enzymol Relat Areas Mol Biol. 1998. V. 72:87-144. -   Zehl M, Pitternauer E, Rizzi A, Allmaier G. Characterization of moe     antibiotic complex by multistage MALDI-IT/RTOF-MS and ESI-IT-MS. J     Am Soc Mass Spectrom. 2006. V. 17:1081-1090. -   Zhang L, Radziejewska-Lebrecht J, Krajewska-Pietrasik D, Toivanen P     and Skurnik M. Molecular and chemical characterization of the     lipopolysaccharide O-antigen and its role in the virulence of     Yersinia enterocolitica serotype O:8. Mol Microbiol. 1997. V.     27(1):63-76. -   Zhu L, Ostash B, Rix U, Nur-E-Alam M, Mayers A, Luzhetskyy A, Mendez     C, Salas J A, Bechthold A, Fedorenko V, Rohr J. Identification of     the function of gene IndM2 encoding a bifunctional     oxygenase-reductase involved in the biosynthesis of the antitumor     antibiotic landomycin E by Streptomyces globisporus 1912 supports     the originally assigned structure for landomycinone. J Org     Chem. 2005. 70:631-638.

The contents of the aforementioned references are incorporated herein by reference in their entireties.

EQUIVALENTS

The present invention is not to be limited in terms of the particular embodiments described in this application, which are intended as single illustrations of individual aspects of the invention. Many modifications and variations of this invention can be made without departing from its spirit and scope, as will be apparent to those skilled in the art. Functionally equivalent methods and apparatuses within the scope of the invention, in addition to those enumerated herein, will be apparent to those skilled in the art from the foregoing descriptions. Such modifications and variations are intended to fall within the scope of the appended claims. The present invention is to be limited only by the terms of the appended claims, along with the full scope of equivalents to which such claims are entitled. 

1-11. (canceled)
 12. A method of synthesizing a moenomycin, a moenomycin derivative, or a moenomycin intermediate wholly or partially in vitro comprising: reacting a one or more moenomycin precursor, derivative and/or moenomycin intermediate with a one or more polypeptide selected from the group consisting of: moeA4, moeB4, moeC4, moeB5, moe A5, moeD5, moeJ5, moeE5, moeF5, moeH5, moeK5, moeM5, moeN5, moe05, moeX5, moeP5, moeR5, moeS5, moeGT1, moeGT2, moeGT3, moeGT4, and moeGT5, under conditions wherein the moenomycin, the moenomycin derivative, or the intermediate is wholly or partially synthesized.
 13. A method of modifying a moenomycin wholly or partially in vitro comprising: reacting a moenomycin, a moenomycin derivative or a moe intermediate with a one or more polypeptide selected from the group consisting of: moeA4, moeB4, moeC4, moeB5, moe A5, moeD5, moeJ5, moeE5, moeF5, moeH5, moeK5, moeM5, moeN5, moeO5, moeX5, moeP5, moeR5, moeS5, moeGT1, moeGT2, moeGT3, moeGT4, and moeGT5, under conditions wherein the moenomycin, the moenomycin derivative, or the moenomycin intermediate is modified. 14-15. (canceled)
 16. A moenomycin derivative having the structure:

wherein Ac is acetyl; R and R¹ independently are selected from the group consisting of hydroxyl, and —NHR² where R² is hydrogen, alkyl, cycloalkyl, or substituted cycloalkyl; X is hydrogen, or

R³ is selected from the group consisting of hydrogen and hydroxyl; and X¹ is selected from the group consisting of hydrogen,

R⁴ is selected from the group consisting of hydrogen and hydroxyl; R⁵ is selected from the group consisting of hydroxyl and —NHR⁶ where R⁶ is selected from the group consisting of hydrogen, alkyl, cycloalkyl, and substituted cycloalkyl, and R⁷ is hydrogen or methyl, or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.
 17. The moenomycin derivative of claim 16, wherein R and R¹ independently are —NH₂.
 18. The moenomycin derivative of claim 16, wherein X is hydrogen or


19. (canceled)
 20. The moenomycin derivative of claim 16, wherein X¹ is hydrogen,

21-22. (canceled)
 23. The moenomycin derivative of claim 16, wherein the structure is:

R³ is selected from the group consisting of hydrogen and hydroxyl, or a pharmaceutically acceptable salt, tautomer, and/or ester thereof.
 24. The moenomycin derivative of claim 23, wherein R3 is hydrogen or hydroxyl.
 25. (canceled)
 26. The moenomycin derivative of claim 16, wherein the structure is:

R⁴ is selected from the group consisting of hydrogen and hydroxyl, or a pharmaceutically acceptable salt, tautomer, and/or ester thereof. 27-28. (canceled)
 29. The moenomycin derivative of claim 16, wherein the structure is:

R⁴ is selected from the group consisting of hydrogen and hydroxyl, or a pharmaceutically acceptable salt, tautomer, and/or ester thereof. 30-31. (canceled)
 32. The moenomycin derivative of claim 16, wherein the structure is:

R⁴ is selected from the group consisting of hydrogen and hydroxyl, or a pharmaceutically acceptable salt, tautomer, and/or ester thereof. 33-34. (canceled)
 35. The moenomycin derivative of claim 16, wherein the structure is:

R⁴ is selected from the group consisting of hydrogen and hydroxyl, PG^(P)-1⁵¹, or a pharmaceutically acceptable salt, tautomer, and/or ester thereof. 36-37. (canceled)
 38. The moenomycin derivative of claim 16, wherein the structure is:

wherein R⁴ is hydrogen or hydroxyl and R⁶ is selected from the group consisting of hydrogen, alkyl, cycloalkyl, and substituted cycloalkyl, or a pharmaceutically acceptable salt, tautomer, and/or ester thereof. 39-43. (canceled)
 44. A pharmaceutical composition comprising the moenomycin derivative of claim 16 and a pharmaceutically acceptable carrier.
 45. A moenomycin derivative having the structure:

wherein R⁷ and R⁸ independently are selected from the group consisting of hydroxyl and —NHR⁹ where R⁹ is selected from the group consisting of hydrogen, alkyl, cycloalkyl, and substituted cycloalkyl; and R¹⁰ is hydrogen or hydroxyl; or a pharmaceutically acceptable salt, tautomer, and/or ester thereof. 46-48. (canceled)
 49. A pharmaceutical composition comprising the moenomycin derivative of claim 45 and a pharmaceutically acceptable carrier.
 50. An isolated Streptomyces strain selected from the group consisting of: Streptomyces ghanaensis, Streptomyces ederensis, Streptomyces geysiriensis, and Streptomyces bambergiensis strain which carries a one or more mutant or inactivated genes, wherein the mutant or inactivated genes are selected from the group consisting of: moeA4, moeB4, moeC4, moeB5, moe A5, moeD5, moeJ5, moeE5, moeF5, moeH5, moeK5, moeM5, moeN5, moeO5, moeX5, moeP5, moeR5, moeS5, moeGT1, moeGT2, moeGT3, moeGT4, and moeGT5.
 51. The isolated Streptomyces strain of claim 50, wherein the Streptomyces ghanaensis strain is Streptomyces ghanaensis ATCC14627. 52-54. (canceled)
 55. The method according to claim 12, wherein the method further comprises reacting the moenomycin, moenomycin derivative and/or moenomycin intermediate with a one or more reactants selected from the group consisting of: UDP-sugars, prenyl-pyrophosphates, phosphoglycerate, amino acids, carbamoyl phosphate, ATP and biological cofactors.
 56. The method according to claim 13, wherein the method further comprises reacting the moenomycin, moenomycin derivative and/or moenomycin intermediate with a one or more reactants selected from the group consisting of: UDP-sugars, prenyl-pyrophosphates, phosphoglycerate, amino acids, carbamoyl phosphate, ATP and biological cofactors. 